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TAOJIKs AS MODIFIERS OF THE BETA-CATENIN PATHWAY AND 

METHODS OF USE 

REFERENCE TO RELATED APPLICATIONS 

5 This application claims priority to U.S. provisional patent application 60/340,3 12 

filed 12/13/2001. The content of the prior application is hereby incorporated in its 
entirety. 

BACKGROUND OF THE INVENTION 

10 Beta-catenin is an adherens junction protein. Adherens junctions (AJs-, also called 

the zonula adherens) are critical for the establishment and maintenance of epithelial layers, 
such as those lining organ surfaces. AJs mediate adhesion between cells, communicate a 
signal that neighboring cells are present, and anchor the actin cytoskeleton. In serving 
these roles, AJs regulate normal cell growth and behavior. At several stages of 
15 embryogenesis, wound healing, and tumor cell metastasis, cells form and leave epithelia. 
This process, which involves the disruption and reestablishment of epithelial cell-cell 
contacts, may be regulated by the disassembly and assembly of AJs. AJs may also 
function in the transmission of the 'contact inhibition' signal, which instructs cells to stop 
dividing once an epithelial sheet is complete. 
20 The AJ is a multiprotein complex assembled around calcium-regulated cell 

adhesion molecules called cadherins (Peifer, M.(1993) Science 262: 1667-1668). 
Cadherins are transmembrane proteins: the extracellular domain mediates homotypic 
adhesion with cadherins on neighboring cells, and the intracellular domain interacts with 
cytoplasmic proteins that transmit the adhesion signal and anchor the AJ to the actin 
25 cytoskeleton. These cytoplasmic proteins include the alpha-, beta-, and gamma-catenins. 
The beta-catenin protein shares 70% amino acid identity with both plakoglobin, which is 
found in desmosomes (another type of intracellular junction), and the product of the 
Drosophila segment polarity gene 'armadillo'. Armadillo is part of a multiprotein AJ 
complex in Drosophila that also includes some homologs of alpha-catenin and cadherin, 
30 and genetic studies indicate that it is required for cell adhesion and cytoskeletal integrity. 

Beta-catenin, in addition to its role as a cell adhesion component, also functions as 
a transcriptional co-activator in the Wnt signaling pathway through its interactions with 
the family of Tcf and Lef transcription factors (for a review see Polakis, (1999) Current 
Opinion in Genetics & Development, 9:15-21 and Gat U., et al., (1998) Cell 95:605-614). 
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The APC gene, which is mutant in adenomatous polyposis of the colon, is a 
negative regulator of beta-catenin signaling (Korinek, V. et al., (1997) Science 275: 1784- 
1787; Morin, P. J., et al., (1997) Science 275: 1787-1790). The APC protein normally 
binds to beta-catenin and, in combination with other proteins (including glycogen synthase 
5 kinase-3b and axin, is required for the efficient degradation of b-catenin. The regulation 
of beta-catenin is critical to the tumor suppressive effect of APC and that this regulation 
can be circumvented by mutations in either APC or beta-catenin. 

While mammals contain only a single beta-catenin gene, C elegans contains three 
(Korswagen HC, et aL, (2000) Nature 406:527-32). Each worm beta-catenin appears to 
10 carry out unique functions (Korswagen HC, et al., (2000) Nature 406:527-32, Nartarajan L 
et al. (2001) Genetics 159: 159-72). Because of the divergence of function in C. elegans, 
it is possible to specifically study beta-catenin role in cell adhesion, which is mediated by 
^ the Ci elegans beta-catenin HMP-2. 

Eukaryotic cells respond to extracellular stimuli by recruiting signal transduction 
15 pathways, many of which employ protein Ser/Thr kinases of the ERK family (Levin, D. 
E., and Errede, B. (1995) Curr. Opin. Cell Biol. 7, 197-202). The ubiquity of ERKs and 
their upstream activators, the MEKs, in signal transduction was first appreciated from 
studies of yeast (Herskowitz, I. (1995) Cell 80, 187-197). Part of the cellular response to 
toxins, physical stresses and inflammatory cytokines occurs by signalling via the stress- 
20 activated protein kinase (SAPK) and p3 8 reactivating kinase pathways (Kyriakis, J. M., 
and Avruch, J. (1990) J. Biol. Chem. 265, 17355-17363; Kyriakis, J. M., et aL, (1991) J. 
Biol. Chem. 266, 10043-10046; Pulverer, B. J., et al., (1991) Nature 353, 670-674). These 
stress-responsive kinase pathways are structurally similar, but functionally distinct, from 
the archetypal mitogen-activated protein kinases (MAPKs or ERKs). The stimuli that start 
25 the pathway result in modification of cellular gene expression, growth arrest, apoptosis, or 
: activation of immune and reticuloendothelial cells. TAOl (thousand and one amino acid) 
-, is a protein kinase that may play a role in regulating stress-responsive MAP kinase 
pathways (Hutchison, M., et al (1998). J Biol Chem 273:28625-32). KIAA1361 is a 
protein with strong similarity with TAO 1 . JIK (JNK-S APK inhibitory kinase) is an 
30 -ri STE20-like serine/threonine kinase and member of the GCK-like subfamily of Ste20 
kinases. JIK is activated by ligand-bound EGF receptors, inhibits the JNK/SAPK 
signaling pathway, and also interacts with IRE1 (a gene involved in endoplasmic 
reticulum stress response) (Tassi, E., et al (1999) J Biol Chem 274:33287-95; Zhang, W., 
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et al (2000) Biochem Biophys Res Commun 274:872-9; Yoneda, T., et al (2001) J Biol 
Chem 276:13935-40). 

The ability to manipulate the genomes of model organisms such as C. elegans 
provides a powerful means to analyze biochemical processes that, due to significant 
5 evolutionary conservation, have direct relevance to more complex vertebrate organisms. 
Due to a high level of gene and pathway conservation, the strong similarity of cellular 
processes, and the functional conservation of genes between these model organisms and 
mammals, identification of the involvement of novel genes in particular pathways and 
their functions in such model organisms can directly contribute to the understanding of the 
10 correlative pathways and methods of modulating them in mammals (see, for example, 
Dulubova I, et al, J Neurochem 2001 Apr;77(l):229-38; Cai T, et al., Diabetologia 2001 
Jan;44(l):81-8; Pasquinelli AE, et al., Nature. 2000 Nov 2;408(6808):37-8; Ivanov IP, et 
al., EMBO J 2000 Apr 17;19(8): 1907-17; Vajo Z et al., Mamm Genome 1999 
Oct;10(10): 1000-4). For example, a genetic screen can be carried out in an invertebrate 
15 model organism having underexpression (e.g. knockout) or overexpression of a gene 

(referred to as a "genetic entry point") that yields a visible phenotype. Additional genes 
are mutated in a random or targeted manner. When a gene mutation changes the original 
phenotype caused by the mutation in the genetic entry point, the gene is identified as a 
"modifier" involved in the same or overlapping pathway as the genetic entry point. When 
20 the genetic entry point is an ortholog of a human gene implicated in a disease pathway, 
such as beta-catenin, modifier genes can be identified that may be attractive candidate 
targets for novel therapeutics. 

All references cited herein, including patents, patent applications, publications, and 
sequence information in referenced Genbank identifier numbers, are incorporated herein in 
25 their entireties. 

SUMMARY OF THE INVENTION 

We have discovered genes that modify the beta-catenin pathway in C. elegans, and 
identified their human orthologs, hereinafter referred to as TAO and JIK related kinases 
30 (TAOJIK). The invention provides methods for utilizing these beta-catenin modifier 
genes and polypeptides to identify TAOJIK-modulating agents that are candidate 
therapeutic agents that can be used in the treatment of disorders associated with defective 
or impaired beta-catenin function and/or TAOJIK function. Preferred TAOJIK- 
modulating agents specifically bind to TAOJIK polypeptides and restore beta-catenin 
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function. Other preferred TAOJIK-modulating agents are nucleic acid modulators such as 
antisense oligomers and RNAi that repress TAOJIK gene expression or product activity 
by, for example, binding to and inhibiting the respective nucleic acid (i.e. DNA or 
mRNA). 

5 TAOJIK modulating agents may be evaluated by any convenient in vitro or in vivo 

assay for molecular interaction with a TAOJIK polypeptide or nucleic acid. In one 
embodiment, candidate TAOJIK modulating agents are tested with an assay system 
comprising a TAOJIK polypeptide or nucleic acid. Agents that produce a change in the 
activity of the assay system relative to controls are identified as candidate beta-catenin 

10 modulating agents. The assay system may be cell-based or cell-free. TAOJIK-modulating 
agents include TAOJIK related proteins (e.g. dominant negative mutants, and 
biotherapeutics); TAOJIK-specific antibodies; TAOJIK-specific . antisense oligomers and 
other nucleic acid modulators; and chemical agents that specifically bind to or interact 
with TAOJIK or compete with TAOJIK binding partner (e.g.-by binding to a TAOJIK 

15 binding partner). In one specific embodiment, a small molecule modulator is identified 
using a kinase assay. In specific embodiments, the screening assay system is selected 
vrifrom a binding assay, an apoptosis assay, a cell proliferation assay * an angiogenesis assay, 
and a hypoxic induction assay. 

In another embodiment, candidate beta-catenin pathway modulating agents are 

20 further tested using a second assay system that detects changes in the beta-catenin 

pathway, such as angiogenic, apoptotic, or cell proliferation changes produced by the 
originally identified candidate agent or an agent derived from the original agent. The 
second assay system may use cultured cells or non-human animals. In specific 
embodiments, the secondary assay system uses non-human animals, including animals 

25 predetermined to have a disease or disorder implicating the beta-catenin pathway, such as 
an angiogenic, apoptotic, or cell proliferation disorder (e.g. cancer); 

The invention further provides methods for modulating the TAOJIK function 
and/or the beta-catenin pathway in a mammalian cell by contacting the mammalian cell 
with an agent that specifically binds a TAOJIK polypeptide or nucleic acid. The agent 

30 may be a small molecule modulator, a nucleic acid modulator, or an antibody and may be 
administered to a mammalian animal predetermined to have a pathology associated the 
beta-catenin pathway. 
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DETAILED DESCRIPTION OF THE INVENTION 

Genetic screens were designed to identify modifiers of the beta-catenin pathway in 
C. elegwis. A weak allele of beta-catenin was used in our screen (a homozygous viable 
mutant of beta-catenin, allele qm39). The hmp-2 (qm-39) strain produces larval worms 
5 with a highly penetrant lumpy body phenotype in first stage larval worms (Lis). Various 
specific genes were silenced by RNA inhibition (RNAi). Methods for using RNAi to 
silence genes in C. elegans are known in the art (Fire A, et aL, 1998 Nature 391:806- 
811; Fire, A. Trends Genet. 15, 358-363 (1999); W09932619). Genes causing altered 
phenotypes in the worms were identified as modifiers of the beta-catenin pathway. A 

10 modifier of particular interest was T17E9.1. Accordingly, vertebrate orthologs of these 
modifiers, and preferably the human orthologs, TAOJIK genes (i.e., nucleic acids and 
polypeptides) are attractive drug targets for the treatment of pathologies associated with a 
defective beta-catenin signaling pathway, such as cancer. 

In vitro and in vivo methods of assessing TAOJIK function are provided herein. 

15 Modulation of the TAOJIK or their respective binding partners is useful for understanding 
the association of the beta-catenin pathway and its members in normal and disease 
conditions and for developing diagnostics and therapeutic modalities for beta-catenin 
related pathologies. TAOJIK-modulating agents that act by inhibiting or enhancing 
TAOJIK expression, directly or indirectly, for example, by affecting a TAOJIK function 

20 such as enzymatic (e.g., catalytic) or binding activity, can be identified using methods 
provided herein. TAOJIK modulating agents are useful in diagnosis, therapy and 
pharmaceutical development. 

Nucleic acids and polypeptides of the invention 

25 Sequences related to TAOJIK nucleic acids and polypeptides that can be used in 

the invention are disclosed in Genbank (referenced by Genbank identifier (GI) number) as 
GI#s 4759207 (SEQ ID NO:l), 15929548 (SEQ ID NO:2), 7706400 (SEQ ID NO:4), 

7705559 (SEQ ID NO:5), 20552662 (SEQ ID NO:6), 12803830 (SEQ ID NO:7), 
19923463 (SEQ ID NO:8), 15302531 (SEQ ID NO:9), 7243102 (SEQ ID NO: 10), 

30 18587661 (SEQ ID NO:l 1), 20559660 (SEQ ID NO: 12), 1 1596143 (SEQ ID NO: 13), and 
12803830 (SEQ ID NO: 15) for nucleic acid, and GI#s 4759208 (SEQ ID NO:16), 

7705560 (SEQ ID NO: 17), and 7243 103 (SEQ ID NO: 1 8) for polypeptides. Additionally, 
nucleic acid sequences of SEQ ID NOs:3 and 14 can also be used in the invention. 

5 
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TAOJIKs are kinase proteins with kinase domains. The term "TAOJIK 
polypeptide" refers to a full-length TAOJIK protein or a functionally active fragment or 
derivative thereof. A "functionally active" TAOJIK fragment or derivative exhibits one or 
more functional activities associated with a full-length, wild-type TAOJIK protein, such as 
5 antigenic or immunogenic activity, enzymatic activity, ability to bind natural cellular 
• . substrates, etc. The functional activity of TAOJIK proteins, derivatives and fragments can 
be assayed by various methods known to one skilled in the art (Current Protocols in 
Protein Science (1998) Coligan et al. 9 eds., John Wiley & Sons, Inc., Somerset, New 
* Jersey) and as further discussed below. In one embodiment* a functionally active TAOJIK 

10 polypeptide is a TAOJIK derivative capable of rescuing defective endogenous TAOJIK 
activity, such as in cell based or animal assays; the rescuing derivative may be from the 
same or a different species. For purposes herein, functionally active fragments also 
include those fragments that comprise one or more structural domains of a TAOJIK, such 
as a kinase domain or a binding domain. Protein domains can be identified using the 

15 PFAM program (Bateman A., et al., Nucleic Acids Res, 1999, 27:260-2). For example, 

the kinase (PFAM 00069) domain of TAOJIK from GI#s 4759208, 7705560, and 7243103 
(SEQ ID NOs: 16, 17, and 18, respectively) are located at approximately amino acid 
residues 28 to 281, 24 to 277, and 32 to 285, respectively. Methods for obtaining TAOJIK 
polypeptides are also further described below, In some embodiments, preferred fragments 

20 are functionally active, domain-containing fragments comprising at least, 25 contiguous 
amino acids, preferably at least 50, more preferably 75, and most preferably at least 100 
contiguous amino acids of any one of SEQ ID NOs: 16-1 8 (a TAOJIK). In further 
preferred embodiments, the fragment comprises the entire kinase (functionally active) 
domain. 

25 The term "TAOJIK nucleic acid" refers to a DNA or RNA molecule that encodes a 

TAOJIK polypeptide. Preferably, the TAOJIK polypeptide or nucleic acid or fragment 
thereof is from a human, but can also be an prtholog, or derivative thereof with at least 
70% sequence identity, preferably at least 80%, more preferably 85%, still more 
preferably 90%, and most preferably at least 95% sequence identity with human TAOJIK. 
30 Methods of identifying orthlogs are known in the art. Normally, orthologs in different 
species retain the same function, due to presence of one or more protein motifs and/or 3- 
dimensional structures. Orthologs are generally identified by sequence homology 
analysis, such as BLAST analysis, usually using protein bait sequences. Sequences are 
assigned as a potential ortholog if the best hit sequence from the forward BLAST result 

6 
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retrieves the original query sequence in the reverse BLAST (Huynen MA and Bork P, 
Proc Natl Acad Sci (1998) 95:5849-5856; Huynen MA et ah, Genome Research (2000) 
10:1204-1210). Programs for multiple sequence alignment, such as CLUSTAL 
(Thompson ID et al, 1994, Nucleic Acids Res 22:4673-4680) may be used to highlight 
5 conserved regions and/or residues of orthologous proteins and to generate phylogenetic 
trees. In a phylogenetic tree representing multiple homologous sequences from diverse 
species (e.g., retrieved through BLAST analysis), orthologous sequences from two species 
generally appear closest on the tree with respect to all other sequences from these two 
species. Structural threading or other analysis of protein folding (e.g., using software by 

10 ProCerybn, Biosciences, Salzburg, Austria) may also identify potential orthologs. In 

evolution, when a gene duplication event follows speciation, a single gene in one species, 
such as C. elegans y may correspond to multiple genes (paralogs) in another, such as 
human. As used herein, the term "orthologs" encompasses paralogs. As used herein, 
"percent (%) sequence identity" with respect to a subject sequence, or a specified portion 

15 of a subject sequence, is defined as the percentage of nucleotides or amino acids in the 

candidate derivative sequence identical with the nucleotides or amino acids in the subject 
sequence (or specified portion thereof), after aligning the sequences and introducing gaps, 
if necessary to achieve the maximum percent sequence identity, as generated by the 
program WU-BLAST-2.0al9 (Altschul et a/., J. Mol. Biol. (1997) 215:403-410) with all 

20 the search parameters set to default values. The HSP S and HSP S2 parameters arc 

dynamic values and are established by the program itself depending upon the composition 
of the particular sequence and composition of the particular database against which the 
sequence of interest is being searched. A % identity value is determined by the number of 
matching identical nucleotides or amino acids divided by the sequence length for which 

25 the percent identity is being reported. "Percent (%) amino acid sequence similarity" is 
determined by doing the same calculation as for determining % amino acid sequence 
identity, but including conservative amino acid substitutions in addition to identical amino 
acids in the computation. 

A conservative amino acid substitution is one in which an amino acid is substituted 

30 for another amino acid having similar properties such that the folding or activity of the 

protein is not significantly affected. Aromatic amino acids that can be substituted for each 
other are phenylalanine, tryptophan, and tyrosine; interchangeable hydrophobic amino 
acids are leucine, isoleucine, methionine, and valine; interchangeable polar amino acids 
are glutamine and asparagine; interchangeable basic amino acids are arginine, lysine and 



WO 03/051905 



PCT/US02/39742 



histidine; interchangeable acidic amino acids are aspartic acid and glutamic acid; and 
interchangeable small amino acids are alanine, serine, threonine, cysteine and glycine. 

Alternatively, an alignment for nucleic acid sequences is provided by the local 
homology algorithm of Smith and Waterman (Smith and Waterman, 1981, Advances in 
5 Applied Mathematics 2:482-489; database: European Bioinformatics Institute; Smith and 
Waterman, 1981, J. of Molec.Biol., 147:195-197; Nicholas et al„ 1998, "A Tutorial on 

ic— " — 

u" - . 

Searching Sequence Databases and Sequence Scoring Methods" (wv/w.psc.edu) and 
. v< references cited therein.; W.R. Pearson, 1991, Genomics 11:635-650). This algorithm can 
be applied to amino acid sequences by using the scoring matrix developed by Dayhoff 

10 (Dayhoff: Atlas of Protein Sequences and Structure, M. O. Dayhoff ed., 5 suppl. 3:353- 

358, National Biomedical Research Foundation, Washington, D.C., USA), and normalized 
; by Gribskov (CJribskoy 1986 Nucl. Acids Res. 14(6):6745-6763). The Smith-Waterman 
algorithm may be employed where default parameters are used for scoring (for example, 
gap open penalty of 1 2, gap extension penalty of two) . From -the data generated, the 

15 "Match" value reflects "sequence identity." 

Derivative nucleic acid molecules of the subject nucleic acid molecules include 
sequences that hybridize to.the nucleic acid sequence of any -of SEQ H) NOs: l-15. The 
stringency of hybridization can be controlled by temperature, ionic strength, pH, and the 
presence of denaturing agents , such as formamide during hybridization and washing. 

20 Conditions routinely used are set out in readily available procedure texts (e.g., Current 

Protocol in Molecular Biology, Vol. 1, Chap. 2.10, John Wiley & Sons, Publishers (1994); 
Sambrook et al., Molecular Cloning, Cold Spring Harbor (1989)). In some embodiments, 
a nucleic acid molecule of the invention is capiable of hybridizing to a nucleic acid 
molecule containing the nucleotide sequence of any one of SEQ ID NOs: 1-15 under high 

25 stringency hybridization conditions that are: prehybridizatipn pf filters containing nucleic 
acid for 8 hours to overnight at 65° C in a solution comprising 6X single strength citrate 
(SSC) (1X SSC is 0.15 M NaCl, 0.015 M Na citrate; pH 7,0), 5X Denhardt f s solution, 
0i05% sodium pyrophosphate and 100 Mg/ml herring sperm DNA; hybridization for 18-20 
hours at 65° C in a solution containing 6X SSC, IX Denhardt's solution, 100 /ig/ml yeast 

30 ; tRNA and 0.05% sodium pyrophosphate; and washing of filters at 65° C for lh in a 
solution containing 0. IX SSC ^ and 0.1% SDS (sodium dodecyl sulfate). 

In other embodiments, moderately stringent hybridization conditions are used that 
are: pretreatment of filters containing nucleic acid for 6 h at 40° C in a solution containing 
35% formamide, 5X SSC, 50 mM Tris-HCl (pH7,5), 5mM EDTA, 0.1% PVP, 0.1% 

8 
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Ficoll, 1% BSA, and 500 fig/ml denatured salmon sperm DNA; hybridization for 18-20h 
at 40° C in a solution containing 35% formamide, 5X SSC, 50 mM Tris-HCl (pH7.5), 
5mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 fig/mi salmon sperm DNA, and 
10% Cwt/vol) dextran sulfate; followed by washing twice for 1 hour at 55° C in a solution 
5 containing 2XSSC and 0.1% SDS. 

Alternatively, low stringency conditions can be used that are: incubation for 8 
hours to overnight at 37° C in a solution comprising 20% formamide, 5 x SSC, 50 mM 
sodium phosphate (pH 7.6), 5X Denhardt's solution, 10% dextran sulfate, and 20 /zg/ml 
denatured sheared salmon sperm DNA; hybridization in the same buffer for 18 to 20 
10 hours; and washing of filters in 1 x SSC at about 37° C for 1 hour. 

Isolation, Production, Expression, and Mis-expression of TAOJIK Nucleic Acids and 
Polypeptides 

TAOJIK nucleic acids and polypeptides, useful for identifying and testing agents 

15 that modulate TAOJIK function and for other applications related to the involvement of 
TAOJIK in the beta-catenin pathway. TAOJIK nucleic acids and derivatives and 
orthologs thereof may be obtained using any available method. For instance, techniques 
for isolating cDNA or genomic DNA sequences of interest by screening DNA libraries or 
by using polymerase chain reaction (PCR) are well known in the art. In general, the 

20 particular use for the protein will dictate the particulars of expression, production, and 
purification methods. For instance, production of proteins for use in screening for 
modulating agents may require methods that preserve specific biological activities of these 
proteins, whereas production of proteins for antibody generation may require structural 
integrity of particular epitopes. Expression of proteins to be purified for screening or 

25 antibody production may require the addition of specific tags (e.g., generation of fusion 
proteins). Overexpression of a TAOJIK protein for assays used to assess TAOJIK 
function, such as involvement in cell cycle regulation or hypoxic response, may require 
expression in eukaryotic cell lines capable of these cellular activities. Techniques for the 
expression, production, and purification of proteins are well known in the art; any suitable 

30 means therefore may be used (e.g., Higgins SJ and Hames BD (eds.) Protein Expression: 
A Practical Approach, Oxford University Press Inc., New York 1999; Stanbury PF et aL, 
Principles of Fermentation Technology, 2 nd edition, Elsevier Science, New York, 1995; 
Doonan S (ed.) Protein Purification Protocols, Humana Press, New Jersey, 1996; Coligan 
JE et al, Current Protocols in Protein Science (eds.), 1999, John Wiley & Sons, New 
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York). In particular embodiments, recombinant TAOJIK is expressed in a cell line known 
to have defective beta-catenin function. The recombinant cells are used in cell-based 
screening assay systems of the invention, as described further below. 

The nucleotide sequence encoding a TAOJIK polypeptide can be inserted into any 
5 appropriate expression vector. The necessary transcriptional and translational signals, 
including promoter/enhancer element, can derive from the native TAOJIK gene and/or its 
flanking regions or can be heterologous. A variety of host- vector expression systems may 
be utilized, such as mammalian cell systems infected with virus {e.g. vaccinia virus, 

"V: 4- 

adenovirus, etc.); insect cell systems infected with virus (e.g. baculo virus); 

10 microorganisms such as yeast containing yeast vectors, or bacteria transformed with 

bacteriophage, plasmid, or cosmid DNA. An isolated host cell strain that modulates the 
expression of, modifies, and/or specifically processes the gene product may be used. 

To detect expression of the TAOJIK gene product, the expression vector can 
comprise a promoter operably linked to a TAOJIK gene nucleic acid, one or more origins 

15 of replication, and, one or more selectable markers (e.g. thymidine kinase activity, 
resistance to antibiotics, etc.). Alternatively, recombinant expression vectors can be 
identified by assaying for the expression of the TAOJIK gene product based on the 
physical or functional properties of the TAOJIK protein in in vitro assay systems (e.g. 
immunoassays). 

20 The TAOJIK protein, fragment, or derivative may be optionally expressed as a 

fusion, or chimeric protein product (i.e. it is joined via a peptide bond to a heterologous 
protein sequence of a different protein), for example to facilitate purification or detection. 
A chimeric product can be made by ligating the appropriate nucleic acid sequences 
encoding the desired amino acid sequences to each other using standard methods and 

25. expressing the chimeric product. A chimeric product may also be made by protein 

synthetic techniques, e.g. by use of a {peptide synthesizer (Hunkapiller et oi., Nature (1984) 
310:105-111). 

Once a recombinant cell that expresses the TAOJIK gene sequence is identified, 
the gene product can be isolated and purified using standard methods (e.g. ion exchange, 
30 affinity, and gel exclusion chromatography; centrifugation; differential solubility; 

electrophoresis). Alternatively, native TAOJIK proteins can be purified from natural 
sources, by standard methods (e.g. immunoaffinity purification). Once a protein is 
obtained, it may be quantified and its activity measured by appropriate methods, such as 
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immunoassay, bioassay, or other measurements of physical properties, such as 
crystallography. 

The methods of this invention may also use cells that have been engineered for 
altered expression (mis-expression) of TAOJIK or other genes associated with the beta- 
5 catenin pathway. As used herein, mis-expression encompasses ectopic expression, over- 
expression, under-expression, and non-expression (e.g. by gene knock-out or blocking 
expression that would otherwise normally occur). 

Genetically modified animals 

10 Animal models that have been genetically modified to alter TAOJIK expression 

may be used in in vivo assays to test for activity of a candidate beta-catenin modulating 
agent, or to further assess the role of TAOJIK in a beta-catenin pathway process such as 
apoptosis or cell proliferation. Preferably, the altered TAOJIK expression results in a 
detectable phenotype, such as decreased or increased levels of cell proliferation, 

15 angiogenesis, or apoptosis compared to control animals having normal TAOJIK 

expression. The genetically modified animal may additionally have altered beta-catenin 
expression (e.g. beta-catenin knockout). Preferred genetically modified animals are 
mammals such as primates, rodents (preferably mice or rats), among others. Preferred 
non-mammalian species include zebrafish, C elegans, and Drosophila. Preferred 

20 genetically modified animals are transgenic animals having a heterologous nucleic acid 
sequence present as an extrachromosomai element in a portion of its cells, i.e. mosaic 
animals (see, for example, techniques described by Jakobovits, 1994, Curr. Biol. 4:761- 
763.) or stably integrated into its germ line DNA (i.e., in the genomic sequence of most or 
all of its cells). Heterologous nucleic acid is introduced into the germ line of such 

25 transgenic animals by genetic manipulation of, for example, embryos or embryonic stem 
cells of the host animal. 

Methods of making transgenic animals are well-known in the art (for transgenic 
mice see Brinster et aL, Proc. Nat Acad. Sci. USA 82: 4438-4442 (1985), U.S. Pat. Nos. 
4,736,866 and 4,870,009, both by Leder et aL, U.S. Pat No. 4,873,191 by Wagner et al„ 

30 and Hogan, B., Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., (1986); for particle bombardment see U.S. Pat No., 4,945,050, 
by Sandford et al.\ for transgenic Drosophila see Rubin and Spradling, Science (1982) 
218:348-53 and U.S. Pat. No. 4,670,388; for transgenic insects see Berghammer AJ. et 
al., A Universal Marker for Transgenic Insects (1999) Nature 402:370-371; for transgenic 
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Zebrafish see Lin S., Transgenic Zebrafish, Methods Mol Biol. (2000); 136:375-3830); for 
microinjection procedures for fish, amphibian eggs and birds see Houdebine and 
Chourrout, Experientia (1991) 47:897-905; for transgenic rats see Hammer et aL, Cell 
(1990) 63:1099-1112; and for culturing of embryonic stem (ES) cells and the subsequent 
5 production of transgenic animals by the introduction of DNA into ES cells using methods 
such as electroporatioh, calcium phosphate/DNA precipitation and direct injection see, 
e.g., Teratocarcinomas and Embryonic Stem Cells, A Practical Approach, E. J. Robertson, 
ed., IRL Press (1987)). Clones of the nonhuman transgenic animals can be produced 
according to available methods (see Wilmut, I. et ol. (1997) Nature 385:810-813; and PCT 
10 International Publication Nos. WO 97/07668 and WO 97/07669). 

In one embodiment,^ the transgenic animal is a "knock-out" animal having a 
heterozygous or homozygous alteration in the sequence of an endogenous TAOJIK gene 
that results in a decrease of TAOJIK function, preferably such that TAOJIK expression is 
undetectable or insignificant. Knock-out animals are typically generated by homologous 
15 recombination with a vector comprising a trarisgene having at least a portion of the gene to 
be knocked out. Typically a deletion, addition or substitution has been introduced into the 
transgene td functionally disrupt it. The transgetie can be a human gene (e.g., from a 
human genomic clone) but more preferably is an ortholog of the human gene derived from 
the transgenic host species. For example, a mouse TAOJIK gene is used to construct a 
20 homologous recombination vector suitable for altering an endogenous TAOJIK gene in the 
mouse genome. Detailed methodologies for homologous recombination in mice are 
available (see Capecchi, Science (1989) 244: 1288-1292; Joyner et ah, Nature (1989) 
338:153-156). Procedures for the production of non-rodent transgenic mammals and other 
animals are also available (Houdebine and Chourrout, supra; Pursel et ah. Science (1989) 
25 244:1281-1288; Simms et aL, Bio/Technology (1988) 6:179-183). In a preferred 

embodiment, knock-out animals, such as mice harboring a knockout of a specific gene, 
may be used to produce antibodies against the human counterpart of the gene that has been 
knocked out (Claesson MH et al M (1994) Scan J Immunol 40:257-264; Declerck PJ et 
al., (1995) J Biol Chem; 270:8397-400). : 
30 In another embodiment, the transgenic animal is a "knock-in" animal having an 

alteration in its genome that results in altered expression (e.g., increased (including 
ectopic) or decreased expression) of the TAOJIK gene, e.g., by introduction of additional 
copies of TAOJIK, or by operati vely inserting a regulatory sequence that provides for 
altered expression of an endogenous copy of the TAOJIK gene. Such regulatory 
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sequences include inducible, tissue-specific, and constitutive promoters and enhancer 
elements. The knock-in can be homozygous or heterozygous. 

Transgenic nonhuman animals can also be produced that contain selected systems 
allowing for regulated expression of the transgene. One example of such a system that 
5 may be produced is the cre/loxP recombinase system of bacteriophage PI (Lakso et al., 
PNAS (1992) 89:6232-6236; U.S. Pat. No. 4,959,317). If a cre/loxP recombinase system 
is used to regulate expression of the transgene, animals containing transgenes encoding 
both the Cre recombinase and a selected protein are required. Such animals can be 
provided through the construction of "double" transgenic animals, e.g., by mating two * 
10 transgenic animals, one containing a transgene encoding a selected protein and the other 
containing a transgene encoding a recombinase. Another example of a recombinase 
system is the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. 
(1991) Science 251:1351-1355; U.S. Pat. No. 5,654,182). In a preferred embodiment, 
both Gre-LoxP and Flp-Frt are used in the same system to regulate expression of the 
15 transgene, and for sequential deletion of vector sequences in the same cell (Sun X et al 
(2000) Nat Genet 25:83-6). 

The genetically modified animals can be used in genetic studies to further elucidate 
the beta-catenin pathway, as animal models of disease and disorders implicating defective 
beta-catenin function, and for in vivo testing of candidate therapeutic agents, such as those 
20 identified in screens described below. The candidate therapeutic agents are administered 
to a genetically modified animal having altered TAOJTK function and phenotypic changes 
are compared with appropriate control animals such as genetically modified animals that 
receive placebo treatment, and/or animals with unaltered TAOJIK expression that receive 
candidate therapeutic agent. 
25 In addition to the above-described genetically modified animals having altered 

TAOJIK function, animal models having defective beta-catenin function (and otherwise 
normal TAOJIK function), can be used in the methods of the present invention. For 
example, a beta-catenin knockout mouse can be used to assess, in vivo, the activity of a 
candidate beta-catenin modulating agent identified in one of the in vitro assays described 
30 below. Preferably, the candidate beta-catenin modulating agent when administered to a 
model system with cells defective in beta-catenin function, produces a detectable 
phenotypic change in the model system indicating that the beta-catenin function is 
restored, i.e., the cells exhibit normal cell cycle progression. 

13 
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Modulating Agents 

The invention provides methods to identify agents that interact with and/or 
modulate the function of TAOJIK and/or the beta-catenin pathway. Modulating agents 
identified by the methods are also part of the invention. Such agents are useful in a variety 
5 of diagnostic and therapeutic applications associated with the beta-catenin pathway, as 
weir as in further analysis of the TAOJIK protein and its contribution to the beta-catenin 
pathway. Accordingly, the invention also provides methods for modulating the beta- 
< : 4 catenih pathway comprising the step of specifically modulating TAOJIK activity by 

administering a TAO JIK-interacting or -modulating agent. 
10 As used herein, a 'TAOJIK-modulating agent" is any agent that modulates 

? TAOJIK function* for example, an agent that interacts withTAOJIK to inhibit or enhance 
TAOJIK activity or otherwise affect normal TAOJIK function. TAOJIK function can be 
t; affected at any level, including transcription, protein expression, protein localization, and 

cellular or extracellular activity. In a preferred embodiment the TAOJIK - modulating 
1 5 agent specifically modulates the function of the TAOJIK. The phrases "specific 

modulating agent", "specifically modulates", etc., are used herein to refer to modulating 
* agents that directly bind to the TAOJIK polypeptide or nucleic acid, and preferably inhibit, 
enhance, or otherwise alter, the function of the TAOJIK. These phrases also encompasses 
/ modulating; agents that alter the interaction of the TAOJIK with a binding partner, 
20 substrate, or cofactor (e.g. by binding to a binding partner of a TAOJIK, or to a 

protein/binding partner complex, and altering TAOJIK function). In a further preferred 
embodiment, the TAOJIK- modulating agent is a modulator of the beta-catenin pathway 
(e.g. it restores and/or upregulates beta-catenin function) and thus is also a beta-catenin- 
modulating agent. 

25 Preferred TAOJIK-modulating agents include small molecule compounds; 

TAOJIK-interacting proteins, including antibodies and other biotherapeu tics; and nucleic 
acid modulators such as antisense and RNA inhibitors. The modulating agents may be 
formulated in pharmaceutical compositions, for example, as compositions that may 
v comprise other active ingredients, as in combination therapy, and/or suitable carriers or 

30 excipients. Techniques for formulation and administration of the compounds may be 

found in "Remington's Pharmaceutical Sciences" Mack Publishing Co.j Easton, PA, 19 th 
edition. 
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Small molecule modulators 

Small molecules are often preferred to modulate function of proteins with 
enzymatic function, and/or containing protein interaction domains. Chemical agents, 
referred to in the art as "small molecule" compounds are typically organic, non-peptide 
5 molecules, having a molecular weight less than 10,000, preferably less than 5,000, more 
preferably less than 1,000, and most preferably less than 500. This class of modulators 
includes chemically synthesized molecules, for instance, compounds from combinatorial 
chemical libraries. Synthetic compounds may be rationally designed or identified based 
on known or inferred properties of the TAO JIK protein or may be identified by screening 

10 compound libraries. Alternative appropriate modulators of this class are natural products, 
particularly secondary metabolites from organisms such as plants or fungi, which can also 
be identified by screening compound libraries for TAOJIK-modulating activity. Methods 
for generating and obtaining compounds are well known in the art (Schreiber SL, Science 
(2000) 151: 1964-1969; Radmann J and Gunther J, Science (2000) 151:1947-1948). 

15 Small molecule modulators identified from screening assays, as described below, 

can be used as lead compounds from which candidate clinical compounds may be 
designed, optimized, and synthesized. Such clinical compounds may have utility in 
treating pathologies associated with the beta-catenin pathway. The activity of candidate 
small molecule modulating agents may be improved several-fold through iterative 

20 secondary functional validation, as further described below, structure determination, and 
candidate modulator modification and testing. Additionally^ candidate clinical compounds 
are generated with specific regard to clinical and pharmacological properties. For example, 
the reagents may be derivatized and re-screened using in vitro and in vivo assays to 
optimize activity and minimize toxicity for pharmaceutical development. 

25 

Protein Modulators 

Specific TAOJIK-interacting proteins are useful in a variety of diagnostic and 
therapeutic applications related to the beta-catenin pathway and related disorders, as well 
as in validation assays for other TAO JIK-modulating agents. In a preferred embodiment, 
30 TAOJIK-interacting proteins affect normal TAOJIK function, including transcription, 

protein expression, protein localization, and cellular or extra-cellular activity. In another 
embodiment, TAOJIK-interacting proteins are useful in detecting and providing 
information about the function of TAOJIK proteins, as is relevant to beta-catenin related 
disorders, such as cancer (e.g., for diagnostic means). 

15 
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An TAOJIK-interacting protein may be endogenous, i.e. one that naturally 
interacts genetically or biochemically with a TAOJIK, such as a member of the TAOJTK 
pathway that modulates TAOJIK expression, localization, and/or activity. TAOJIK- 
modulators include dominant negative forms of TAOJIK-interacting proteins and of 
5 TAOJIK proteins themselves. Yeast two-hybrid and variant screens offer preferred 
v . methods for identifying endogenous TAOJIK-interacting proteins (Finley, R. L. et al. 

(1996) in DNA Cloning-Expression Systems: A Practical Approach, eds. Glover D. & 
;K Hames B. D (Oxford University Press, Oxford, England), pp. 169-203; Fashema SF et al., 
Gene (2000) 250: 1-14; Drees BL Gurr Opin Ghem Biol (1999) 3:64^70; Vidal M and 
10 Legrain P Nucleic Acids Res (1999) 27:919-29; and U.S. Pat. No. 5,928,868). Mass 
, spectrometry is an alternative preferred method for the elucidation of protein complexes 
_< (reviewed in, e.g., Pandley A and Mann M, Nature (2000) 405:837-846; Yates JR 3 rd , 
fe Trends Genet (2000) 16:5-8). 

An TAOJIK-interacting protein may be an exogenous-protein, such as a TAOJIK- 
15 specific antibody or a T-cell antigen receptor (see, e.g., Harlow and Lane (1988) 

Antibodies, A Laboratory Manual, Gold Spring Harbor Laboratory; Harlow and Lane 
(1999) Using antibodies: a laboratory manual. Cold Spring Harbor, NY: Cold Spring 
Harbor Laboratory Press). TAOJIK antibodies are further discussed below. 

In preferred embodiments, a TAOJIK-interacting protein specifically binds a 
20 TAOJIK protein. In alternative preferred embodiments, a TAO JIK-modulating agent 
binds a TAOJIK substrate, binding partner, or cofactor. 

Antibodies 

In another embodiment, the protein modulator is a TAOJIK specific antibody 
25 agonist or antagonist. The antibodies have therapeutic and diagnostic utilities, and can be 
used in screening assays to identify TAOJIK modulators. The antibodies can also be used 
; , in dissecting the portions of the TAOJIK pathway responsible for various cellular 
responses and in the general processing and maturation of the TAOJIK. V 

Antibodies that specifically bind TAOJIK polypeptides can be generated using 
30 ■ known methods. Preferably the antibody is specific to a mammalian ortholog of TAOJIK 
, polypeptide, and more preferably, to human TAOJIK. Antibodies may be polyclonal, 
. monoclonal (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab 
fragments, F(ab').sub.2 fragments, fragments produced by a FAb expression library, anti- 
idiotype (anti-Id) antibodies, and epitope-binding fragments of any of the above. 
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Epitopes of TAOJIK which are particularly antigenic can be selected, for example, by 
routine screening of TAOJIK polypeptides for antigenicity or by applying a theoretical 
method for selecting antigenic regions of a protein (Hopp and Wood (1981), Proc. Nati. 
Acad. Sci. U.S.A. 78:3824-28; Hopp and Wood, (1983) Mol. Immunol. 20:483-89; 
5 Sutcliffe et aL, (1983) Science 219:660-66) to the amino acid sequence shown in any of 
SEQ ID NOs:16-18. Monoclonal antibodies with affinities of 10 8 M" 1 preferably 10 9 M~ l 
to 10 10 NT 1 , or stronger can be made by standard procedures as described (Harlow and 
Lane, supra; Goding (1986) Monoclonal Antibodies: Principles and Practice (2d ed) 
Academic Press, New York; and U.S. Pat. Nos. 4,381,292; 4,451,570; and 4,618,577). 
10 Antibodies may be generated against crude cell extracts of TAOJIK or substantially 

purified fragments thereof. If TAOJIK fragments are used, they preferably comprise at 
least 10, and more preferably, at least 20 contiguous amino acids of a TAOJIK protein. In 
a particular embodiment, TAOJIK-specific antigens and/or immunogens are coupled to 
carrier proteins that stimulate the immune response. For example, the subject 
15 polypeptides are covalently coupled to the keyhole limpet hemocyanin (KLH) carrier, and 
the conjugate is emulsified in Freund's complete adjuvant, which enhances the immune 
response. An appropriate immune system such as a laboratory rabbit or mouse is 
immunized according to conventional protocols. 

The presence of TAOJIK-specific antibodies is assayed by an appropriate assay 
20 such as a solid phase enzyme-linked immunosorbant assay (ELISA) using immobilized 
corresponding TAOJIK polypeptides. Other assays, such as radioimmunoassays or 
fluorescent assays might also be used. 

Chimeric antibodies specific to TAOJIK polypeptides can be made that contain 
different portions from different animal species. For instance, a human immunoglobulin 
25 constant region may be linked to a variable region of a murine mAb, such that the 
antibody derives its biological activity from the human antibody, and its binding 
specificity from the murine fragment. Chimeric antibodies are produced by splicing 
together genes that encode the appropriate regions from each species (Morrison et aL, 
Proc. Natl. Acad. Sci. (1984) 81:6851-6855; Neuberger et aL, Nature (1984) 312:604-608; 
30 Takeda et aL, Nature (1985) 31:452-454). Humanized antibodies, which areaform of 
chimeric antibodies, can be generated by grafting complementary-determining regions 
(CDRs) (Carlos, T. M., J. M. Harlan. 1994. Blood 84:2068-2101) of mouse antibodies 
into a background of human framework regions and constant regions by recombinant 
DNA technology (Riechmann LM, et aL, 1988 Nature 323: 323-327). Humanized 
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antibodies contain -10% murine sequences and -90% human sequences, and thus further 
reduce or eliminate immunogenicity, while retaining the antibody specificities (Co MS, 
and Queen C. 1991 Nature 351: 501-501; Morrison SL. 1992 Ann. Rev. Immun. 
10:239-265). Humanized antibodies and methods of their production are well-known in 
5 the art (U.S. Pat. Nos. 5,530,101, 5,585,089, 5,693,762, and 6,180,370). 

TAOJIK-specific single chain antibodies which are recombinant, single chain 
polypeptides formed by linking the heavy and light chain fragments of the Fv regions via 
an amino acid bridge, can be produced by methods known in the art (U.S. Pat. No. 
4,946,778; Bird, Science (1988) 242:423-426; Huston et al., Proc. Natl. Acad. Sci. USA 
10 (1988) 85:5879-5883; and Ward et al., Nature (1989) 334:544-546). 

Other suitable techniques for antibody production involve in vitro exposure of 
lymphocytes to the antigenic polypeptides or alternatively to selection of libraries of 
antibodies in phage or similar, vectors (Huse etal., Science (1989) 246:1275-1281). As 
used herein, T-cell antigen receptors are included within the scope of antibody modulators 
15 (Harlow and Lane, 1988, supra). 

The polypeptides and antibodies of the present invention may.be used with or 
without modification. Frequendy, antibodies will be labeled by joining, either covalently 
or non-covalently, a substance that provides for a detectable signal, or that is toxic to cells 
that express the targeted protein (Menard S, et al., Int J. Biol Markers (1989) 4:131-134). 
20 A wide variety of labels and conjugation techniques are laiovvn and are reported 
extensively in both the sc^ntific and patent literature. Suitable labels include 
radionuclides, enzymes, substrates, cefaclors, inhibitors, fluorescent moieties, fluorescent 
emitting lanthanide metals, chemiluminescent moieties, bioluminescent moieties, 
magnetic particles, and the like (U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 
25 3,996,345; 4,277,437; 4,275,149; and 4,366,241). Also, recombinant immunoglobulins 
may be produced (U.S. Pat No. 4,816,567). Antibodies to cytoplasmic polypeptides may 
be delivered and reach their targets by conjugation with membranerpenetrating toxin 
proteins (U.S. Pat. No. 6,086,900). 

When used therapeutically in a patient, the antibodies of the subject invention are 
30 typically administered parenterally, when possible at the target site, or intravenously. The 
therapeutically effective dose and dosage regimen is determined by clinical studies. 
Typically, the amount of antibody administered is in the range of about 0.1 mg/kg -to 
about 10 mg/kg of patient weight. For parenteral administration, the antibodies are 
formulated in a unit dosage injectable form (e.g., solution, suspension, emulsion) in 
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association with a pharmaceutical^ acceptable vehicle. Such vehicles are inherently 
nontoxic and non-therapeutic. Examples are water, saline, Ringer's solution, dextrose 
solution, and 5% human serum albumin. Nonaqueous vehicles such as fixed oils, ethyl 
oleate, or liposome carriers may also be used. The vehicle may contain minor amounts of 
5 additives, such as buffers and preservatives, which enhance isotonicity and chemical 
stability or otherwise enhance therapeutic potential. The antibodies' concentrations in 
such vehicles are typically in the range of about 1 mg/ml to aboutlO mg/ml. 
Immunotherapeutic methods are further described in the literature (US Pat. No. 5,859,206; 
WO(fo73469). 

10 

Nucleic Acid Modulators 

Other preferred TAOJIK-modulating agents comprise nucleic acid molecules, such 
as antisense oligomers or double stranded RNA (dsRNA), which generally inhibit 
TAOJIK activity. Preferred nucleic acid modulators interfere with the function of the 

15 TAOJIK nucleic acid such as DNA replication, transcription, translocation of the TAOJIK 
RNA to the site of protein translation, translation of protein from the TAOJIK RNA, 
splicing of the TAOJIK RNA to yield one or more mRNA species, or catalytic activity 
which may be engaged in or facilitated by the TAOJIK RNA. 

In one embodiment, the antisense oligomer is an oligonucleotide that is sufficiently 

20 complementary to a TAOJIK mRNA to bind to and prevent translation, preferably by 
binding to the 5' untranslated region. TAOJIK-specific antisense oligonucleotides, 
preferably range from at least 6 to about 200 nucleotides. In some embodiments the 
oligonucleotide is preferably at least 10, 15, or 20 nucleotides in length. In other 
embodiments, the oligonucleotide is preferably less than 50, 40, or 30 nucleotides in 

25 length. The oligonucleotide can be DNA or RNA or a chimeric mixture or derivatives or 
modified versions thereof, single-stranded or double-stranded. The oligonucleotide can be 
modified at the base moiety, sugar moiety, or phosphate backbone. The oligonucleotide 
may include other appending groups such as peptides, agents that facilitate transport 
across the cell membrane, hybridization-triggered cleavage agents, and intercalating 

30 agents. 

In another embodiment, the antisense oligomer is a phosphothioate morpholino 
oligomer (PMO). PMOs are assembled from four different morpholino subunits, each of 
which contain one of four genetic bases (A, C, G, or T) linked to a six-membered 
morpholine ring. Polymers of these subunits are joined by non-ionic phosphodiamidate 
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intersubunit linkages. Details of how to make and use PMOs and other antisense 
oligomers are well known in the art (e.g. see W099/18 193; Probst JC, Antisense 
Oligodeoxynucleotide and Ribozyme Design, Methods. (2000) 22(3):271-281; Summerton 
J, and Weller D. 1997 Antisense Nucleic Acid Drug Dev. :7:187-95; US Pat. No. 
5 5,235,033; and US Pat No. 5,378,841). 

Alternative preferred TAOJIK nucleic acid modulators are double-stranded RNA 
species mediating RNA interference: (RNAi). RNAi is the process of sequence-specific, 
post-transcriptional gene silencing in animals and plants, initiated by double-stranded 
RNA (dsRNA) that is homologous in sequence to the silenced gene. Methods relating to 

10 the use of RNAi to silence genes in C. elegans, Drosophila, plants, and humans are known 
in the art (Fire A, etaL, 1998 Nature 391:806-811; Fire, A. Trends Genet 15, 358-363 
(1999); Sharp, P. A. RNA interference 2001. Genes Dev. 15, 485-490 (2001); Hammond, 
S. M., et al,, Nature Rev. Genet. 2, 110*1119 (2001); Tuschl, T. Chem. Biochem. 2, 239- 
245 (2001); Hamilton, A. et al., Science 286, 950-952 (1999); Hammond, S. M., et al., 

15 Nature 404, 293-296 (2000); Zamore, P. D., et al., Cell 101, 25-33 (2000); Bernstein, E., 
et al., Nature 409, 363-366 (2001); Elbashir, S. M., et al., Genes Dev. 15, 188-200 
(2001); WO0129058; WO9932619; Elbashir SM, et al,, 2001 Nature 411:494-498). 

Nucleic acid modulators are commonly used as research reagents, diagnostics, and 
therapeutics. For example, antisense oligonucleotides, which are able to inhibit gene 

20 expression with exquisite specificity, are often used to elucidate the function of particular 
genes (see, for example, U.S. Pat. No. 6,165,790). Nucleic acid modulators are also used, 
for example, to distinguish between functions of various members of a biological pathway. 
For example, antisense oligomers have been employed as therapeutic moieties in the 
treatment of disease states in animals and man and have been. demonstrated in numerous 

25 clinical trials to be safe and effective (Milligan JF, et al, Current Concepts in Antisense 
i Drug Design, J Med Chem. (1993) 36: 1923-1937; Tonkinson JL ^ a/., Antisense 
Oligodeoxynucleotides as Clinical Therapeutic Agents, Cancer Invest. (1996) 14:54-65). 
Accordingly, in one aspect of the invention, a TAOJIK-specific nucleic acid modulator is 
used in an assay to further elucidate the role of the TAOJIK in the beta-catenin pathway, 

30 and/or its relationship to other members of the pathway. In another aspect of the 
invention, a TAOJIK-specific antisense oligomer is used as a therapeutic agent for 
treatment of beta-catehin-related disease states. 
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Assay Systems 

The invention provides assay systems and screening methods for identifying 
specific modulators of TAOUK activity. As used herein, an "assay system" encompasses 
all the components required for performing and analyzing results of an assay that detects 
5 and/or measures a particular event. In general, primary assays are used to identify or 
confirm a modulator's specific biochemical or molecular effect with respect to the 
TAOJIK nucleic acid or protein. In general, secondary assays further assess the activity of 
a TAOJIK modulating agent identified by a primary assay and may confirm that the 
modulating agent affects TAOJIK in a manner relevant to the beta-catenin pathway. In 

10 some cases, TAOJIK modulators will be directly tested in a secondary assay. 

In a preferred embodiment, the screening method comprises contacting a suitable 
assay system comprising a TAOJIK polypeptide or nucleic acid with a candidate agent 
under conditions whereby, but for the presence of the agent, the system provides a 
reference activity (e.g. kinase activity), which is based on the particular molecular event 

15 the screening method detects. A statistically significant difference between the agent- 
biased activity and the reference activity indicates that the candidate agent modulates 
TAOJIK activity, and hence the beta-catenin pathway. The TAOJIK polypeptide or 
nucleic acid used in the assay may comprise any of the nucleic acids or polypeptides 
described above. 

20 

Primary Assays 

The type of modulator tested generally determines the type of primary assay. 

Primary assays for small molecule modulators 

25 For small molecule modulators, screening assays are used to identify candidate 

modulators- Screening assays may be cell-based or may use a cell-free system that 
recreates or retains the relevant biochemical reaction of the target protein (reviewed in 
Sittampalam GS et al, Curr Opin Chem Biol (1997) 1:384-91 and accompanying 
references). As used herein the term "cell-based" refers to assays using live cells, dead 

30 cells, or a particular cellular fraction, such as a membrane, endoplasmic reticulum, or 
mitochondrial fraction. The term "cell free" encompasses assays using substantially 
purified protein (either endogenous or recombinantly produced), partially purified or crude 
cellular extracts. Screening assays may detect a variety of molecular events, including 
protein-DNA interactions, protein-protein interactions (e.g., receptor-ligand binding), 
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transcriptional activity (e.g., using a reporter gene), enzymatic activity (e.g., via a property 
of the substrate), activity of second messengers, immunogenicty and changes in cellular 
morphology or other cellular characteristics. Appropriate screening assays may use a wide 
range of detection methods including fluorescent, radioactive, colorimetric, 
5 spectrophotometry, and amperometric methods, to provide a read-out for the particular 

-7i? molecular event detected. 

Cell-based screening assays usually require systems for recombinant expression of 
:d TAOJIK and any auxiliary prbteins demanded by the particular assay. Appropriate 

methods for generating recombinant proteins produce sufficient quantities of proteins that 

10 retain their relevant biological activities and are of sufficient purity to optimize activity 
and assure assay reproducibility. Yeast two-hybrid and variant screens, and mass 
spectrometry provide preferred methods for determining protein-protein interactions and 
elucidation of protein complexes. In certain applications, when TAOJIK-interacting 
proteins are used in screens to identify small molecule modulators, the binding specificity 

15 of the interacting protein to the TAOJIK protein may be assayed by various known 
methods such as substrate processing (e.g. ability of the candidate TAGJIK-specific 
binding agents to function as negative effectors in TAOJIK-expressing cells), binding 
equilibrium constants (usually at least about 10 7 M" 1 , preferably at least about 10? M" 1 , 
more preferably at least about 10? M" 1 ), and immunogenicity (e.g. ability to elicit TAOJIK 

20 . : specific antibody in a heterologous host such as a mouse, rat, goat or rabbit), For enzymes 
and receptors, binding may be assayed by, respectively, substrate and ligand processing. 

The screening assay may measure a candidate agent's ability to specifically bind to 
or modulate activity of a TAOJIK polypeptide, a fusion protein thereof, or to cells or 
membranes bearing the polypeptide or fusion protein.. The TAOJIK polypeptide can be 

25 full length or a fragment thereof that retains functional TAOJIK activity. The TAOJIK 

tr polypeptide may be fused to another polypeptide, such as a peptide tag for detection or 

anchoring, or to another tag. The TAOJIK polypeptide is preferably human TAOJIK, or is 
an ortholog or derivative thereof as described above. In a preferred embodiment, the 
screening assay detects candidate agerit^based modulation of TAOJIK interaction with a 

30 binding target, such as an endogenous or exogenous protein or other substrate that has 
TAOJIK -specific binding activity, and ean be used to assess normal TAOJIK gene 
function; 

Suitable assay formats that may be adapted to screen for TAOJIK modulators are 
known in the art. Preferred screening assays are high throughput or ultra high throughput 
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and thus provide automated, cost-effective means of screening compound libraries for lead 
compounds (Fernandes PB, Curr Opin Chem Biol (1998) 2:597-603; Sundberg SA, Curr 
Opin Biotechnol 2000, 11:47-53). In one preferred embodiment, screening assays uses 
fluorescence technologies, including fluorescence polarization, time-resolved 
5 fluorescence, and fluorescence resonance energy transfer. These systems offer means to 
monitor protein-protein or DNA-protein interactions in which the intensity of the signal 
emitted from dye-labeled molecules depends upon their interactions with partner 
molecules {e.g., Selvin PR, Nat Struct Biol (2000) 7:730-4; Fernandes PB, supra; 
Hertzberg RP and Pope AJ, Curr Opin Chem Biol (2000) 4:445-451). 
10 A variety of suitable assay systems may be used to identify candidate TAOJIK and 

beta-catenin pathway modulators (e.g. U.S. Pat. No. 6,165,992 (kinase assays); U.S. Pat. 
Nos. 5,550,019 and 6,133,437 (apoptosis assays); U.S. Pat.Nos. 5,976,782, 6,225,118 and 
6,444,434 (angiogenesis assays), among others). Specific preferred assays are described 
in more detail below. 

15 

Kinase assays. In some preferred embodiments the screening assay detects the 
ability of the test agent to modulate the kinase activity of a TAOJIK polypeptide. In 
further embodiments, a cell-free kinase assay system is used to identify a candidate beta- 
catenin modulating agent, and a secondary, cell-based assay, such as an apoptosis or 

20 hypoxic induction assay (described below), may be used to further characterize the 

candidate beta-catenin modulating agent Many different assays for kinases have been 
reported in the literature and are well known to those skilled in the art (e.g. U.S. Pat. No. 
6,165,992; Zhu et al., Nature Genetics (2000) 26:283-289; and WO0073469). 
Radioassays, which monitor the transfer of a gamma phosphate are frequently used. For 

25 instance, a scintillation assay for p56 (lck) kinase activity monitors the transfer of the 

gamma phosphate from gamma - 33 P ATP to a biotinylated peptide substrate; the substrate 
is captured on a streptavidiri coated bead that transmits the signal (Be veridge M et al. , J 
Biomol Screen (2000) 5:205-212). This assay uses the scintillation proximity assay 
(SPA), in which only radio-ligand bound to receptors tethered to the surface of an SPA 

30 bead are detected by the scintillant immobilized within it, allowing binding to be measured 
without separation of bound from free ligand. 

Other assays for protein kinase activity may use antibodies that specifically 
recognize phosphorylated substrates. For instance, the kinase receptor activation (KIRA) 
assay measures receptor tyrosine kinase activity by ligand stimulating the intact receptor 
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in cultured cells, then capturing solubilized receptor with specific antibodies and 
quantifying phosphorylation via phosphotyrosine ELISA (Sadick MD, Dev Biol Stand 
(1999)97:121-133). 

Another example of antibody based assays for protein kinase activity is TRF (time- 
5 resolved fluprometry). This method utilizes europium chelate-labeled anti- 

phosphotyrosine antibodies to detect phosphate transfer to a polymeric substrate coated 
onto microtiter plate wells. The amount of phosphorylation is then detected using time- 
resolved, dissociation-enhanced fluorescence (Braunwalder AF, et al., Anal Biochem 1996 
Jul 1;238(2): 159-64). 

• 10 

a Apoptosis assays. Assays for apoptosis may be performed by terminal 

deoxynucleotidyl transferase-mediated digoxigenin-1 1-dUTP nick end labeling (TUNEL) 
assay. The TUNEL assay is used to measure nuclear DN A fragmentation characteristic of 
apoptosis ( Lazebnik et al, 1994* Nature 371, 346), by following the incorporation of 
15 fluorescein-dUTP (Yonehara et al., 1989, J. Exp. Med. 169, 1747). Apoptosis may further 
be assayed by acridine orange staining of tissue culture cells (Lucas, R., et al., 1998, Blood 
i 15:4730-41). An apoptosis assay system may comprise a cell that expresses a TAOJIK, 

and that optionally has defective beta-catenin function (e.g. beta-catenin is over-expressed 
or under-expressed relative to wild-type cells). A test agent can be added to the apoptosis 
20 assay system and changes in induction of apoptosis relative to controls where no test agent 
is added, identify candidate beta-catenin modulating agents . In some embodiments of the 
invention, an apoptosis assay may be used as a secondary assay to test a candidate beta- 
catenin modulating agents that is initially identified using a cell-free assay system. An 
apoptosis assay may also be used to test whether TAOJIK function plays a direct role in 

1 25 apoptosis. For example, an apoptosis assay may be performed on cells that oyer- or under- 
express TAOJIK relative to wild type cells, differences in apoptotic response compared 
to wild type cells suggests that: the TAOJIK plays a direct role in the apoptotic response. 
Apoptosis assays are described further in US Pat. No. 6,133,437. 

t 30 Cell proliferation and cell cycle assays. Cell proliferation may be assayed via 

bromodeoxyuridine (BRDU) incorporation. This assay identifies a cell population 
undergoing DNA synthesis by incorporation of BRDU into newly-synthesized DNA. 
Newly-synthesized DNA may then be detected using an anti-BRDU antibody (Hoshino et 
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a/., 1986, Int. J. Cancer 38, 369; Campana etaL, 1988, J. Immunol. Meth. 107, 79), or by 
other means. 

Cell Proliferation may also be examined using [ 3 H] -thymidine incorporation 
(Chen, J., 1996, Oncogene 13:1395-403; Jeoung, J., 1995, J. Biol. Chem. 270:18367-73). 
5 This assay allows for quantitative characterization of S-phase DNA syntheses. In this 
assay, cells synthesizing DNA will incorporate [H] -thymidine into newly synthesized 
DNA. Incorporation can then be measured by standard techniques such as by counting of 
radioisotope in a scintillation counter (e.g., Beckman LS 3800 Liquid Scintillation 
Counter). Another proliferation assay uses the dye Alamar Blue (available from 

10 Biosource International), which fluoresces when reduced in living cells and provides an 
indirect measurement of cSU number (Voytik-Harbin SL et al., 1998, In Vitro Cell Dev 
Biol Anim 34:239-46). 

Cell proliferation may also be assayed by colony formation in soft agar (Sambrook 
et al., Molecular Cloning, Cold Spring Harbor (1989)). For example, cells transformed 

15 with TAOJIK are seeded in soft agar plates, and colonies are measured and counted after 
two weeks incubation. 

Involvement of a gene in the cell cycle may be assayed by flow cytometry (Gray 
JW et al. (1986) Int J Radiat Biol Relat Stud Phys Chem Med 49:237-55). Cells 
transfected with a TAOJIK may be stained with propidium iodide and evaluated in a flow 

20 cytometer (available from Becton Dickinson), which indicates accumulation of cells in 
different stages of the cell cycle. 

Accordingly, a cell proliferation or cell cycle assay system may comprise a cell 
that expresses a TAOJIK, and that optionally has defective beta-catenin function (e.g. 
beta-catenin is over-expressed or under-expressed relative to wild-type cells). A test agent 

25 can be added to the assay system and changes in cell proliferation or cell cycle relative to 
controls where no test agent is added, identify candidate beta-catenin modulating agents. 
In some embodiments of the invention, the cell proliferation or cell cycle assay may be 
used as a secondary assay to test a candidate beta-catenin modulating agents that is 
initially identified using another assay system such as a cell-free kinase assay system. A 

30 cell proliferation assay may also be used to test whether TAOJIK function plays a direct 
role in cell proliferation or cell cycle. For example, a cell proliferation or cell cycle assay 
may be performed on cells that over- or under-express TAOJIK relative to wild type cells. 
Differences in proliferation or cell cycle compared to wild type cells suggests that the 
TAOJIK plays a direct role in cell proliferation or cell cycle. 
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Angiogenesis. Angiogenesis may be assayed using various human endothelial cell 
systems, such as umbilical vein, coronary artery, or dermal cells. Suitable assays include 
Alamar Blue based assays (available from Biosource International) to measure 
proliferation; migration assays using fluorescent molecules, such as the use of Becton 
5 Dickinson Falcon HTS FluoroBlock cell culture inserts to measure migration of cells 

- ^ through membranes in presence or absence of angiogenesis enhancer or suppressors; and 
tubule formation assays based on the formation of tubular structures by endothelial cells 

.-K on Matrigel® (Becton Dickinson). Accordingly, an angiogenesis assay system may 

comprise a cell that expresses a TAOJIK, and that optionally has defective beta-catenin 

10 function (e.g. beta-catenin is over-expressed or under-expressed relative to wild-type 
cells). A test agent can be added to the angiogenesis assay system and changes in 
angiogenesis relative to controls where no test agent is added, identify candidate beta- 

^ catenin modulating agents. In some embodiments of the invention, the angiogenesis assay 
may be used as a secondary assay to test a candidate beta-catenin modulating agents that is 

15 initially identified using another assay system. An angiogenesis assay may also be used to 
test whether TAOJIK function plays a direct role in cell proliferation. . For example, an 
angiogenesis assay may be performed on cells that over- or under-express TAOJIK 
relative to wild type cells. Differences in angiogenesis compared to wild type cells 
suggests that the TAOJIK plays a direct role in angiogenesis. U.S. Pat. Nos. 5,976,782, 

20 6,225*118 and 6,444,434, among others. 

Hypoxic induction. The alpha subunit of the transcription factor, hypoxia 
inducible factor- 1 (HIF-1), is upregulated in tumor cells following exposure to hypoxia in 
vitro. Under hypoxic conditions, HIFrl stimulates the expression of genes known to be 
important in tumour cell survival, such as those encoding glyolytic enzymes and VEGF. 
Induction of such genes by hypoxic conditions may be assayed by growing cells 
transfected with TAOJIK in hypoxic conditions (such as with 0.1% 02, 5% CQ2, and 
balance N2, generated in a Napco 7001 incubator (Precision Scientific)) and normoxic 
conditions, followed by assessment of gene activity or expression by Taqman®. For 
exampie v a hypoxic induction assay system may comprise a cell that expresses a TAOJIK, 
and that optionally has defective beta-catenin function (e.g. beta-catenin is over-expressed 
or under-expressed relative to wild-type cells); A test agent can be added to the hypoxic 
induction assay system and changes in hypoxic response relative to controls where no test 
agent is added, identify candidate beta-catenin modulating agents. In some embodiments 
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of the invention, the hypoxic induction assay may be used as a secondary assay to test a 
candidate beta-catenin modulating agents that is initially identified using another assay 
system. A hypoxic induction assay may also be used to test whether TAOJIK function 
plays a direct role in the hypoxic response. For example, a hypoxic induction assay may 
5 be performed on cells that over- or under-express TAOJIK relative to wild type cells. 
Differences in hypoxic response compared to wild type cells suggests that the TAOJIK 
plays a direct role in hypoxic induction. 

Cell adhesion. Cell adhesion assays measure adhesion of cells to purified 

10 adhesion proteins, or adhesion of cells to each other, in presence or absence of candidate 
modulating agents. Cell-protein adhesion assays measure the ability of agents to modulate 
the adhesion of cells to purified proteins. For example, recombinant proteins are 
produced, diluted to 2.5g/mL in PBS, and used to coat the wells of a microliter plate. The 
wells used for negative control are not coated. Coated wells are then washed, blocked 

15 with 1% BSA, and washed again. Compounds are diluted to 2x final test concentration 

and added to the blocked, coated wells. Cells are then added to the wells, and the unbound 
cells are washed off. Retained cells are labeled directly on the plate by adding a 
membrane-permeable fluorescent dye, such as calcein-AM, and the signal is quantified in 
a fluorescent microplate reader. 

20 Cell-cell adhesion assays measure the ability of agents to modulate binding of cell 

adhesion proteins with their native ligands. These assays use cells that naturally or 
recombinantly express the adhesion protein of choice. In an exemplary assay, cells 
expressing the cell adhesion protein are plated in wells of a multiwell plate. Cells 
expressing the ligand are labeled with a membrane-permeable fluorescent dye, such as 

25 BCECF , and allowed to adhere to the monolayers in the presence of candidate agents. 
Unbound cells are washed off, and bound cells are detected using a fluorescence plate 
reader. 

High-throughput cell adhesion assays have also been described. In one such assay, 
small molecule ligands and peptides are bound to the surface of microscope slides using a 
30 microarray spotter, intact cells are then contacted with the slides, and unbound cells are 
washed off. In this assay, not only the binding specificity of the peptides and modulators 
against cell lines are determined, but also the functional cell signaling of attached cells 
using immunofluorescence techniques in situ on the microchip is measured (Falsey JR et 
al., Bioconjug Chem. 2001 May-Jun;12(3):346-53). 
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Tubulogenesis. Tubulogenesis assays monitor the ability of cultured cells, 
generally endothelial cells, to form tubular structures on a matrix substrate, which 
generally simulates the environment of the extracellular matrix. Exemplary substrates 
include Matrigel™ (Becton Dickinson), an extract of basement membrane proteins " 
5 containing laminin, collagen IV, and heparin sulfate proteoglycan, which is liquid at 4° C 
: - - and forms a solid gel at 37° C. Other suitable matrices comprise extracellular components 
such as collagen, fibronectin, and/or fibrin. Cells are stimulated with a pro-angiogenic 
stimulant, and their ability to form tubules is detected by imaging. Tubules can generally 
be detected after an overnight incubation with stimuli* but longer or shorter time frames 

10 - may also be used. Tube formation assays are well known in the art (e.g., Jones MK et al., 
1999, Nature Medicine 5; 1418-1423). These assays hay? traditionally involved 
stimulation with serum or with the growth factors FGF or VEGF. Serum represents an 
undefined source of growth factors. In a preferred embodiment, the assay is performed 
with cells cultured in serum free medium, in order to control "which process or pathway a 

15 candidate agent modulates. Moreover, we have found that different target genes respond 
differently to stimulation with different pro-angiogenic agents, including inflammatory 
angiogenic factors such as TNF-alpa. Thus, in a further preferred embodiment; a 
tubulogenesis assay system comprises testing a TAOJIK's response to a variety of factors, . 
such as FGF, VEGF, phorbol myristate acetate (PMA), TNF-alpha, ephrin, etc. 

20 . •- • ... ... -. . . •• . • 

Cell Migration. An invasion/migration assay (also called a migration assay) tests 
the ability of cells to overcome a physical .barrier and to migrate towards pro-angiogenic 
signals. Migration assays are known in the art (e.g., Eaik JH et al. , 200 1 , J B ipl Chem 
276:1 1830-1 1837). In a typical experimental set-up, cultured endothelial cells are seeded 

25 onto a matrix-coated porous lamina, with pore sizes generally smaller .than typical cell 
size.:" The matrix generally simulates the enviromnent jof the extracellular matrix, as 
described above. The lamina is typically a membrane, such as the transwell polycarbonate 
membrane (Corning Cos tar Corporation,, Cambridge^ MA), and is generally part of an 
upper chamber that is in fluid contact with a lower chamber containing pro-angiogenic 

: 30 stimuli. Migration is generally assayed after an overnight incubation with stimuli, but 
longer or shorter time frames may also be used. Migration is assessed as the number of 
cells that crossed the lamina, and may be detected by staining cells with hemotoxylin 
solution (VWR Scientific, South San Francisco, CA), or by any. other method for 
determining cell number. In another exemplary set up, cells are fluorescently labeled and 
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migration is detected using fluorescent readings, for instance using the Falcon HTS 
FluoroBlok (Becton Dickinson). While some migration is observed in the absence of 
stimulus, migration is greatly increased in response to pro-angiogenic factors. As 
described above, a preferred assay system for migration/invasion assays comprises testing 
5 a TAOJUCs response to a variety of pro-angiogenic factors, including tumor angiogenic 
and inflammatory angiogenic agents, and culturing the cells in serum free medium. 

Sprouting assay* A sprouting assay is a three-dimensional in vitro angiogenesis 
assay that uses a cell-number defined spheroid aggregation of endothelial cells 

10 ("spheroid"), embedded in a collagen gel-based matrix. The spheroid can serve as a 
starting point for the sprouting of capillary-like structures by invasion into the 
extracellular matrix (termed "ceil sprouting") and the subsequent formation of complex 
anastomosing networks (Korff and Augustin, 1999, J Cell Sci 112:3249-58). In an 
exemplary experimental set-up, spheroids are prepared by pipetting 400 human umbilical 

15 vein endothelial cells into individual wells of a nonadhesive 96- well plates to allow 

overnight spheroidal aggregation (Korff and Augustin: J Cell Biol 143: 1341-52, 1998). 
Spheroids are harvested and seeded in 900/xl of methocel-collagen solution and pipetted 
into individual wells of a 24 well plate to allow collagen gel polymerization. Test agents 
are added after 30 min by pipetting 100 fil of 10-fold concentrated working dilution of the 

20 test substances on top of the gel. Plates are incubated at 37°C for 24h. Dishes are fixed at 
the end of the experimental incubation period by addition of paraformaldehyde. Sprouting 
intensity of endothelial cells can be quantitated by an automated image analysis system to 
determine the cumulative sprout length per spheroid. 



25 Primary assays for antibody modulators 

For antibody modulators, appropriate primary assays test is a binding assay that 
tests the antibody's affinity to and specificity for the TAOJIK protein. Methods for testing 
antibody affinity and specificity are well known in the art (Harlow and Lane, 1988, 1999, 
supra). The enzyme-linked immunosorbant assay (ELISA) is a preferred method for, 
30 detecting TAOJIK-specific antibodies; others include FACS assays, radioimmunoassays, 
and fluorescent assays. 

In some cases, screening assays described for small molecule, modulators may also 
be used to test antibody modulators. 
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Primary assays for nucleic acid modulators 

For nucleic acid modulators, primary assays may test the ability of the nucleic acid 
modulator to inhibit or enhance TAOJIK gene expression, preferably mRNA expression. 
In general, expression analysis comprises comparing TAOJIK expression in like 
populations of cells (e.g., two pools of cells that endogenously or recombinantly express 
TAOJIK) in the presence and absence of the nucleic acid modulator. Methods for 
analyzing mRNA and protein expression are well known in the art. For instance, Northern 
blotting, slot blotting* ribonuclease protection, quantitative RT-PCR (e.g., using the 
TaqMan®, PE Applied Biosystems), or microarray analysis may be used to confirm that 
TAOJIK mRNA expression is reduced in cells treated with the nucleic acid modulator 
(e.g., Current Protocols in Molecular Biology (1994) Ausubel FM et al. y eds. y John Wiley 
& Sons, Inc., chapter 4; Freeman WM et al. , Biotechniques (1999) 26: 1 1 2-125; 
Kallioniemi OP, Ann Med 2001, 33:142-147; Blohm DH and Guiseppi-Elie, A Curr Opin 
Biotechnol 2001, 12:41-47). Protein expression may also be" monitored. Proteins are most 
commonly detected with specific antibodies or antisera directed against either the TAOJIK 
protein or specific peptides. A variety of means including Western blotting, ELISA, or in 
situ detection, are available (Harlow E and Lane D* 1988 and 1999, supra). 

In some cases, screening assays described for small molecule modulators, 
particularly in assay systems that involve TAOJIK mRNA expression, may also be used to 
test nucleic acid modulators: 

Secondary Assays • . . 

Secondary assays may be used to further assess the activity of TAOJIK-modulating 
agent identified by any of the above methods to confirm that the modulating agent affects 
25 TAOJIK in a manner relevant to the beta-catenin pathway. As used herein, TAOJIK- 
modulating agents encompass candidate clinical compounds or other agents derived from 
previously identified modulating agent. Secondary assays can also be used to test the 
activity of a modulating agent on a particular genetic or biochemical pathway or to test the 
specificity of the modulating agent's interaction with TAOJIK. 
30 Secondary assays generally compare like populations of cells or animals (e.g., two 

pools of cells or animals that endogenously or recombinantly express TAOJIK) in the 
presence and absence of the candidate modulator. In general, such assays test whether 
treatment of cells or animals with a candidate TAOJIK-modulating agent results in 
changes in the beta-catenin pathway in comparison to untreated (or mock- or placebo- 
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treated) cells or animals. Certain assays use "sensitized genetic backgrounds", which, as 
used herein, describe cells or animals engineered for altered expression of genes in the 
beta-catenin or interacting pathways. 

5 Cell-based assays 

Cell based assays may detect endogenous beta-catenin pathway activity or may 
rely on recombinant expression of beta-catenin pathway components. Any of the 
aforementioned assays may be used in this cell-based format. Candidate modulators are 
typically added to the cell media but may also be injected into cells or delivered by any 
10 other efficacious means. 

Animal Assays 

A variety of non-human animal models of normal or defective beta-catenin 
pathway may be used to test candidate TAOJIK modulators. Models for defective beta- 

15 catenin pathway typically use genetically modified animals that have been engineered to 
mis-express (e.g., over-express or lack expression in) genes involved in the beta-catenin 
pathway. Assays generally require systemic delivery of the candidate modulators, such as 
by oral administration, injection, etc. 

In a preferred embodiment, beta-catenin pathway activity is assessed by 

20 monitoring neovascularization and angiogenesis. Animal models with defective and 
normal beta-catenin are used to test the candidate modulator's affect on TAOJIK in 
Matrigel® assays. Matrigel® is an extract of basement membrane proteins, and is 
composed primarily of laminin, collagen IV, and heparin sulfate proteoglycan. It is 
provided as a sterile liquid at 4° C, but rapidly forms a solid gel at 37° C. Liquid 

25 Matrigel® is mixed with various angiogenic agents, such as bFGF and VEGF, or with 
human tumor cells which over-express the TAOJIK. The mixture is then injected 
subcutaneously(SC) into female athymic nude mice (Taconic, Germantown, NY) to 
support an intense vascular response. Mice with Matrigel® pellets may be dosed via oral 
(PO), intraperitoneal (IP), or intravenous (IV) routes with the candidate modulator. Mice 

30 are euthanized 5-12 days post-injection, and the Matrigel® pellet is harvested for 

hemoglobin analysis (Sigma plasma hemoglobin kit). Hemoglobin content of the gel is 
found to correlate the degree of neovascularization in the gel. ' 

In another preferred embodiment, the effect of the candidate modulator on 
TAOJIK is assessed via tumorigenicity assays. Tumor xenograft assays are known in the 
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art (see, e.g., Ogawa K et al., 2000, Oncogene 19:6043-6052). Xenografts are typically 
implanted SC into female athymic mice, 6-7 week old, as single cell suspensions either 
from a pre-existing tumor or from in vitro culture. The tumors which express the TAOJIK 
endogenously are injected in the flank, 1 x 10 5 to 1 x 10 7 cells per mouse in a volume of 
5 100 |jL using a 27gauge needle. Mice are then ear tagged and tumors are measured twice 
■v„„ weekly. Candidate modulator treatment is initiated on the day the mean tumor weight. 

reaches 100 mg. Candidate modulator is delivered IV, SC, DP, or PO by bolus 
v. administration. Depending upon the pharmacokinetics of each unique candidate 

modulator, dosing can be performed multiple times per day. The tumor weight is assessed 
10 by measuring perpendicular diameters with a caliper and calculated by multiplying the 

measurements of diameters in two dimensions. At the end of the experiment, the excised 
tumors maybe utilized for biomarker identification or further analyses . For 
immunohistochemistry staining, xenograft tumors are fixed in 4% paraformaldehyde, 
0.1M phosphate, pH 7.2, for 6 hours at 4°C, immersed in 30% sucrose in PBS, and rapidly 
15 frozen in isopentane cooled with liquid nitrogen. 

In another preferred embodiment, tumorogenieity is monitored using a hollow fiber 
assay, which is described in U.S. Pat No. US 5,698*413. Briefly, the method comprises 
implanting into a laboratory animal a biocompatible, semipermeable encapsulation device 
containing target cells, treating the laboratory animal with a candidate modulating agent, 
20 and evaluating the target cells for reaction to the candidate modulator. Implanted cells are 
generally human cells from a pre-existing tumor or a tumor cell line. After an appropriate 
period of time, generally around six days, the implanted samples are harvested for 
evaluation of the candidate modulator. Tumorogenieity and modulator efficacy may be 
evaluated by assaying the quantity of viable cells present in the macrocapsule, which can 
25 - be determined by tests known in the art, for example, MTT dye conversion assay, neutral 
^ red dye uptake, trypan blue staining, viable cell counts, the number of colonies formed in 
v v soft agar, the capacity of the cells to recover and replicate in vitro, etc. 

- In another preferred embodiment, a tumorogenieity assay use a transgenic animal, 
usually a mouse, carrying a dominant oncogene or tumor suppressor gene knockout under 
30- the control of tissue specific regulatory sequences; these assays are generally referred to as 
transgenic tumor assays. In a preferred application, tumor development in the transgenic 
- model is well characterized or is controlled. In an exemplary model, the M Rn > l-Tag2" 
transgene, comprising the SV40 large T-antigen oncogene under control of the insulin 
gene regulatory regions is expressed in pancreatic beta cells and results in islet cell 
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carcinomas (Hanahan D, 1985, Nature 315:115-122; Parangi S et al, 1996, Proc Nad Acad 
Sci USA 93: 2002-2007; Bergers G et al, 1999, Science 284:808-812). An "angiogenic 
switch," occurs at approximately five weeks, as normally quiescent capillaries in a subset 
of hyperproliferative islets become angiogenic. The RIP1-TAG2 mice die by age 14^ 
5 weeks. Candidate modulators may be administered at a variety of stages, including just 
prior to the angiogenic switch (e.g., for a model of tumor prevention), during the growth of 
small tumors (e.g., for a model of intervention), or during the growth of large and/or 
invasive tumors (e.g., for a model of regression). Tumorogenicity and modulator efficacy 
can be evaluating life-span extension and/or tumor characteristics, including number of 
10 tumors, tumor size, tumor morphology, vessel density, apoptotic index, etc. 

Diagnostic and therapeutic uses 

Specific TAOJIK-modulating agents are useful in a variety of diagnostic and 
therapeutic applications where disease or disease prognosis is related to defects in the 

15 beta-catenin pathway, such as angiogenic, apoptotic, or cell proliferation disorders. 

Accordingly, the invention also provides methods for modulating the beta-catenin pathway 
in a cell, preferably a cell pre-determined to have defective or impaired beta-catenin 
function (e.g. due to overexpression, underexpression, or misexpression of beta-catenin, or 
due to gene mutations), comprising the step of administering an agent to the cell that 

20 specifically modulates TAOJIK activity. Preferably, the modulating agent produces a 
detectable phenotypic change in the ceil indicating that the beta-catenin function is 
restored. The phrase "function is restored", and equivalents, as used herein, means that 
the desired phenotype is achieved, or is brought closer to normal compared to untreated 
cells. For example, with restored beta-catenin function, cell proliferation and/or 

25 progression through cell cycle may normalize, or be brought closer to normal relative to 
untreated cells. The invention also provides methods for treating disorders or disease 
associated with impaired beta-catenin function by administering a therapeutically effective 
amount of a TAOJIK -modulating agent that modulates the beta-catenin pathway. The 
invention further provides methods for modulating TAOJIK function in a cell, preferably a 

30 cell pre-determined to have defective or impaired TAOJIK function, by administering a 
TAOJIK -modulating agent. Additionally, the invention provides a method for treating 
disorders or disease associated with impaired TAOJIK function by administering a 
therapeutically effective amount of a TAOJIK -modulating agent. 
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The discovery that TAOJIK is implicated in beta-catenin pathway provides for a 
variety of methods that can be employed for the diagnostic and prognostic evaluation of 
diseases and disorders involving defects in the beta-catenin pathway and for the 
identification of subjects having a predisposition to such diseases and disorders. 
5 Various expression analysis methods can be used to diagnose whether TAOJIK 

„ ■ expression occurs in a particular sample, including Northern blotting, slot blotting, 
ribonuclease protection, quantitative RT-PCR, and microarray analysis, (e.g., Current 
Protocols in Molecular Biology (1994) Ausubel FM et al., eds:> John Wiley & Sons, Inc., 
chapter 4; Freeman WM etal;, Biotechniques (1999) 26:112-125; Kallioniemi OP, Ann 

10 Med 2001, 33: 142-147; Blohm and Guiseppi-Elie, Curr Opin Biotechnol 2001, 12:41-47). 
Tissues having a disease or disorder implicating defective beta-catenin signaling that 
express a TAOJIK, are identified as amenable to treatment with a TAOJIK modulating 
V; agent. In a preferred application, the beta-catenin defective tissue overexpresses a 

TAOJIK relative to normal tissue. For example, a Northern blot analysis of mRNA from 

15 tumor and normal cell lines, or from tumor and matching normal tissue samples from the 
same patient, using full or partial TAOJIK cDNA sequences as probes, can determine 
whether particular tumors express or overexpress TAOJIK. Alternatively, the TaqMan® 
is used for quantitative RT-PCR analysis of TAOJIK expression in cell lines, normal 
tissues and tumor samples (PE Applied Biosystems). 

20 Various other diagnostic methods may be performed* for example, utilizing 

reagents such as the TAOJIK oligonucleotides, and antibodies directed against a TAOJIK, 
as described above for: (1) the detection of the presence of TAOJIK gene mutations, or the 
detection of either over- or under-expression of TAOJIK mRNA relative to the non- 
disorder state; (2) the detection of either an over- or an under-abundance of TAOJIK gene 

25 product relative to the non-disorder state; and (3) the detection of perturbations or 
; abnormalities in the signal transduction pathway mediated by TAOJIK. 

Thus, in a specific embodiment, the invention is drawn to a method for diagnosing 
a disease or disorder in a patient that is associated with alterations in TAOJIK expression, 
the method comprising:, a) obtaining a biological sample from the patient; b) contacting 

30 the sample with a probe for TAOJIK expression; c) comparing results from step (b) with a 
control; and d) determining whether step (c) indicates a likelihood of the disease or 
disorder. Preferably, the disease is cancer, most preferably a cancer as shown in TABLE 
2, or indicated as a result of immunohistochemistry analysis (Example VII). The probe 
may be either DNA or protein, including an antibody. 
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EXAMPLES 

The following experimental section and examples are offered by way of illustration 
and not by way of limitation. 

5 I. C. elegans beta-catenin screen 

The identification of mutants that suppress the cell adhesion defect of beta-catenin 
may lead to unique therapeutic targets that inhibit cell migration or metastasis, hrnp-2 was 
initially identified in an EMS screen for defects in body elongation during embryonic 
morphogenesis (see Costa et <aZ., (1998) The Journal of Cell Biology 1998, 141: 297-308). 

10 llie loss of function allele hmp-2 (zu364) exhibits 99% embryonic lethality, with mutant 
embryos arresting during elongation and abnormal bulges forming on the dorsal side. 
About 1% of these embryos hatch to form viable lumpy larvae. The reduction of function 
allele hmp-2 (qm39) yields viable larvae with a characteristic lumpy appearance. When 
grown at 15°C, approximately 92% (SD 3.9) of the LI larvae show this lumpy phenotype, 

15 with the penetrance of the phenotype decreasing as the animals molt and move through 
successive larval stages. For this screen, hmp-2 (qm39) worms were soaked at 15°C in 
double stranded RNA (dsRNA) at the L4 larval stage and the progeny were scored as LI 
larvae for modification of the adhesion defect. The screen protocol is described below. 

20 1) hmp-2 (qni39) animals were bleached and hatched on peptone free agarose 

plates to produce a synchronous population. Starved Lis were transferred to lOx peptone 
plates seeded with 750 fil OP50 (25% w/v in TB) and allowed to develop to the L4 larval 
stage. 

2) dsRNA was dispensed in 6 \i\ aliquots into 96 well round bottom plates 
25 (Nunc #262162). L4 animals were collected by suspension in M9 buffer, washed 2x with 

M9 to remove any excess OP50, and dispensed in 2 \i\ aliquots into the RNA to a total 
worm density of 75-100 worms per well. As a control, multiple wells contained only 
RNA resuspension buffer (lx IM buffer). 

3) Animals were soaked in dsRNA at 15°C for 24 hours. 

30 4) Following dsRNA soaking, the animals were fed in the wells by addition of 

25pl liquid NGM + 3% OP50. The animals were kept at 15°C and allowed to become 
gravid and lay progeny in the wells, which took approximately 72 hours. Food levels were 
monitored visually during maturation and more was added as needed. 
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5) Following maturation, animals from each well were plated onto individual 
6cm peptone free agarose plates and placed at 1 5 °C overnight. 

6) Animals on each plate were scored visually under the dissecting 
microscope for modification of the lumpy phenotype. Scoring was performed 

5 qualitatively, with an increase in dead embryos scored as enhancement and an increase in 
wild type appearing animals scored as suppression of the defect. 

7) Retests of interesting suppressor candidates followed the same protocol as 
the primary screen with certain modifications: several retests were performed for each 
suppressor, retested candidates were encoded so that they could be scored blindly, and 

10 retested candidates were scored quantitatively. Each plate was scored by counting 100 
total objects. An object was defined as either an embryo .or an LI stage larva. Each object 
was scored as one of the following: a wildtype appearing animal, a lumpy appearing 
animal^ or an unhatched embryo. Scores were represented as the percentage of wildtype 
appearing animals relative to all objects scored. Wildtype airimals were defined as LI 

15 larvae with smooth cuticles that did not have any sort of lumpy body morphology. 

8) A confirmed suppressor was one that was > 2 standard deviations away 
from the mean of the controls for at least 3 of the four retest experiments. 

BLAST analysis (Altschul et al., supra) was employed to identify orthologs of the 
C. elegans modifiers. For example, representative sequences from TADJIK, GI# 4759208 
20 (SEQ ED NO: 16), GI# 7705560 (SEQ ID NO: 17) and GI#7243 103 (SEQ ED NO: 1 8) share 
37%, 38%, and 38% amino acid identity, respectively, with the C. elegans T17E9.1. 

Various domains, signals, and functional subunits in proteins were analyzed using 
the PSORT (Nakai K., and Horton P., Trends Biochem Sei, 1999, 24:34-6; Kenta Nakai, 
Protein sorting signals and prediction of subcellular localization, Adv. Protein Ghem. 54, 
25 277-344 (2000)), PFAM (Bateman A., et al., Nucleic Acids Res, 1999, 27:260-2), 

SMART (Ponting CP, et al., SMART: identification and annotation of domains from 
signaling and extracellular protein sequences. Nucleic Acids Res. 1999 Jan 1;27(1):229- 
32), TM-HMM (Erik L.L. Sonnhammer, Gunnar von Heijne, and Anders Krogh: A hidden 
Markov model for predicting transmembrane helices in protein sequences. In Proc. of 
30 Sixth Int. Conf . on Intelligent Systems for Molecular Biology, -p 175-182 Ed J. Glasgow, 
T. Littlejbhn, F. Major, R. Lathrop, Di Sankoff , and C. Senseii Menlo Park, CA: AAA! 
Press, 1998), and clust (Remm M, and Sonnhammer E. Classification of transmembrane 
protein families in the Caenorhabditis elegans genome and identification of human 
orthologs. Genome Res. 2000 Nov; 10(11): 1679-89) programs. For example, the kinase 
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(PFAM 00069) domain of TAOJIK from GI#s 4759208, 7705560, and 7243103 (SEQ ID 
NOs:16, 17, and 18, respectively) are located at approximately amino acid residues 28 to 
281, 24 to 277, and 32 to 285, respectively. 
Results: 

5 Numbers shown in Table 1 are the percentage of wild type appearing animals 

relative to all animals (wtild-type, lumpy and embryos) scored per experiment. Control 
replicates were performed for each retest (#1-4) and the standard deviation for each 
control is listed beneath the average. 

10 Table 1 



Gene ID 


Retest #1 


Retest #2 


Retest #3 


Retest #4 


Mean 


Deviation 


T17E9.1 flan-18) 


29 


23 


36 


10 


24.5 


11.0302614 


Controls average 


9.4 


9.4 


8.1 


8.1 






Deviation 


3.5 


4.1 


3.9 


3.9 







n. RNAi of C. elegans T17E9.1 

T17E9.1/kin-18 (the C. elegans ortholog of human kinases TAOl, JIK, and 
15 KIAA1361) was initially identified in an RNAi gene inactivation screen as suppressors of 
a beta-catenin (hmp-2) lumpy body mutant phenotype resulting from reduced beta-catenin 
function in cytoskeletal organization at cell-cell adhesive junctions (Example I). 
Subsequent testing demonstrated that RNAi inactivation of the C. elegans gene also 
partially suppressed an axin (pry-1) developmental arrest mutant phenotype that appears to 
20 result from increased beta-catenin (bar-1) and TCF (pop-1) transcriptional activation 
function. 

in. High-Throughput In Vitro Fluorescence Polarization Assay 

Fluorescently-labeled TAOJIK peptide/substrate are added to each well of a 96- 
25 well microliter plate, along with a test agent in a test buffer (10 mM HEPES, 10 mM 
NaCl, 6 mM magnesium chloride, pH 7.6). Changes in fluorescence polarization, 
determined by using a Fluorolite FPM-2 Fluorescence Polarization Microtiter System 
(Dynatech Laboratories, Inc), relative to control values indicates the test compound is a 
candidate modifier of TAOJIK activity. 

30 
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IV. High-Throughput In Vitro Binding Assay. 

3? P-labeled TAOJIK peptide is added in an assay buffer (100 mM KC1, 20 mM 
HEPES pH 7.6, 1 mMMgCb, 1% glycerol, 0.5% NP-40, 50 mM beta-mercaptoethanol, 1 
mg/ml BSA, cocktail of protease inhibitors) along with a test agent to the wells of a 
5 Neutralite-avidin coated assay plate and incubated at 25 °C for 1 hour. Bio tiny lated 

substrate is then added to each well and incubated for 1 hour. Reactions are stopped by 
washing with PBS, and counted in a scintillation counter. Test agents that cause a 
difference in activity relative to control without test agent are identified as candidate beta- 
; cateniri modulating agents. 

10 

V. Imniunoprecimtations and Immunoblotting 

For coprecipitation of transfected proteins, 3 x 10 6 appropriate recombinant cells 
containing the TAOJIK proteins are plated on 10-cm dishes and transfected on the 
following day with expression constructs. The total amount of DNA is kept constant in 

15 each trarisfection by adding empty vector. After 24 h, cells are collected, washed once 
with phosphate-buffered saline and lysed for 20 min on ice in 1 ml of lysis buffer 
containing 50 mM Hepes, pH 7.9, 250 mM NaCl, 20 mM -glycerophosphate, 1 mM 
sodium ortho vanadate, 5 mM p-nitrbphenyl phosphate, 2 mM dithiothreitbl, protease 
I inhibitors (complete, Roche Molecular Biochemicals), and 1% Nonidet P-40. Cellular 

20 debris is removed by centrifugation twice at 15,000 x g for 15 min. The cell lysate is 
incubated with 25 /il of M2 beads (Sigma) for 2 h at 4 °C with gentle rocking. 

After extensive washing with lysis buffer, proteins bound to the beads are 
solubilized by boiling in SDS sample buffer, fractionated by SDS-polyaerylamide gel 
electrophoresis, transferred to polyvinylidene difluoride membrane and blotted with the 

25 indicated antibodies. The reactive bands are visualized with horseradish peroxidase 

coupled io the appropriate secondary antibodies and the enhanced chemiluminescence 
(ECL) Western blotting detection sysitem (Amersham Pharmacia Biotech). 

VI. Kinase assay 

30 A purified or partially purified TAOJIK is diluted in a suitable reaction 

buffer, e.g., 50 mM Hepes, pH 7.5, containing magnesium chloride or manganese chloride 
(1-20 mM) and a peptide or polypeptide substrate, such as myelin basic protein or casein 
(1-10 /xg/ml). The final concentration of the kinase is 1-20 nM. The enzyme reaction is 
conducted in microtiter plates to facilitate optimization of reaction conditions by 
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increasing assay throughput. A 96-well microtiter plate is employed using a final volume 
30-100 ill The reaction is initiated by the addition of 33 P-gamma-ATP (0.5 /iCi/ml) and 
incubated for 0.5 to 3 hours at room temperature. Negative controls are provided by the 
addition of EDTA, which chelates the divalent cation (Mg2 + or Mn 2+ ) required for 
5 enzymatic activity. Following the incubation, the enzyme reaction is quenched using 
EDTA. Samples of the reaction are transferred to a 96-well glass fiber filter plate 
(MultiScreen, Millipore). The filters are subsequently washed with phosphate-buffered 
saline, dilute phosphoric acid (0.5%) or other suitable medium to remove excess 
radiolabeled ATP. Scintillation cocktail is added to the filter plate and the incorporated 
10 radioactivity is quantitated by scintillation counting (Wallac/Perkin Elmer). Activity is 
defined by the amount of radioactivity detected following subtraction of the negative 
control reaction value (EDTA quench). 

VII. Expression analysis 
15 All cell lines used in the following experiments are NCI (National Cancer Institute) 

lines, and are available from ATCC (American Type Culture Collection, Manassas, VA 
201 10-2209). Normal and tumor tissues were obtained from Impath, UC Davis, Clontech, 
Stratagene, Ardais, Genome Collaborative, and Ambion. 

TAQMAN ANALYSIS. TaqMan analysis was used to assess expression levels of 
20 the disclosed genes in various samples. 

RNA was extracted from each tissue sample using Qiagen (Valencia, CA) RNeasy 
kits, following manufacturer's protocols, to a final concentration of SOng/pl. Single 
stranded cDNA was then synthesized by reverse transcribing the RNA samples using 
random hexamers and 500ng of total RNA per reaction, following protocol 4304965 of 
25 Applied Biosystems (Foster City, CA). 

Primers for expression analysis using TaqMan assay (Applied Biosystems, Foster 
City, CA) were prepared according to the TaqMan protocols, and the following criteria: a) 
primer pairs were designed to span introns to eliminate genomic contamination, and b) 
each primer pair produced only one product. Expression analysis was performed using a 
30 7900HT instrument. 

Taqman reactions were carried out following manufacturer's protocols, in 25 pi 
total volume for 96-well plates and 10 pi total volume for 384-well plates, using 300nM 
primer and 250 nM probe, and approximately 25ng of cDNA. The standard curve for 
result analysis was prepared using a universal pool of human cDNA samples, which is a 
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mixture of cDNAs from a wide variety of tissues so that the; chance that a target will be 
present in appreciable amounts is good. The raw data were normalized using 18S rRNA 
(universally expressed in all tissues and cells). 

For each expression analysis, tumor tissue samples were compared with matched 
5 normal tissues from the same patient. A gene was considered overexpressed in a tumor 
v when the level of expression of the gene was 2 fold or higher in the tumor compared with 

its matched normal sample. In cases where normal tissue was not available, a universal 
,;./,• pool of cDNA samples was used instead. In these cases, a gene was considered 

overexpressed in a tumor sample when the difference of expression levels between a 
10 tumor sample and the average of all normal samples from the same tissue type was greater 
than 2 times the standard deviation of all normal samples (i.e., Tumor - average(all normal 
samples) > 2 x STDEV(all normal samples) ). 

Results are shown in Table 2. Number of pairs of tumor samples and matched 
normal tissue from the same patient are shown for each tumor type. Percentage of the 
15 samples with at least two-fold overexpression for each tumor type is provided. A 

modulator identified by an assay described herein can be further validated for therapeutic 
effect by administration to a tumor in which the gene is overexpressed. A decrease in 
tumor growth confirms therapeutic utility of the modulator. Prior to treating a patient with 
the modulator, the likelihood: that the patient w ill respond to treatment can be diagnosed 
20 by obtaining a tumor sample from the patient, and assaying for expression of the gene 
targeted by the modulator. The expression data for the gene(s) can also be used as a 
diagnostic marker for disease progression. The assay can be performed by expression 
analysis as described above, by antibody directed to the gene target, or by any other 
available detection method. 
25- . 
Table 2 
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4759207 


1 


0% 


19 


12% 


33 


0% 


8 


21% 


24 


0% 


21 


8% 


12 


5% 


19 


8% 


12 


33% 


3 


7705559 


5 


5% 


19 


15% 


33 


13% 


8 


21% 


24 


5% 


20 


8% 


12 


5%. 


19 


25% 


12 


0% 


3 


7243102 


10 


5% 


19 


12% 


33 


13% 


8 


4% 


24 


5% 


20 


9% 


12 


16% 


19 


17% 


12 


33% 
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IMMUNOHISTOCHEMICAL ANALYSIS. Immunohistochemistry was used 
to localize TAOJIK protein in human tissue sections according to known methods 
(Thomas Boenisch, ed. (2001) Handbook, Immunochemical Staining Methods, 3 rd Edition, 
Dako Corporation, Carpinteria, CA). Antibody to TAOJIK GI#7243103 (SEQ ID NO:18) 
5 (rat TAOl, BD Biosciences; San Diego, CA ) was used for immunohistochemistry against 
tissue arrays containing 20 normal and 19 tumor tissues. Tissue sections were pre-treated 
with heat antigen retrieval in citrate buffer and the antibody was used at 20 ug/ml. In 
normal tissues, expression was observed in smooth muscle, breast myoepithelial cells and 
kidney glomeruli. In tumor tissues, there was overexpression in stomach stromal 
10 sarcomas, breast, endometrial, and bladder cancers. 

Vm. TAOJIK RNAi 

RNA interference experiments were carried out to knock down expression of 
TAOJIKs using small interfering RNAs (siRNA, Elbashir et al, supra). These 

15 experiments were performed against TAOJIKs GI#s 4759207 (SEQ ID NO:l), 7705559 
(SEQ ID NO:5), and 20559660 (SEQ ED NO: 12). For each experiment, three different 
siRNAs (21mer, double stranded RNA oligos with 2 base 3' overhangs, obtained from 
Genset Oligos/Proligo France SAS, France; Dharmacon Research Inc., Lafayette, CO) 
were transfected into cell lines (available from American Type Culture Collection 

20 (ATCC), Manassas, VA) at 100 nM using oligofectamine (Invitrogen). A mock 

transfection reagent without siRNA was included with each experiment. Cells were 
incubated for 3 days at 37 degrees. At the end of each experiment, some wells of cells 
were harvested for protein extracts and used in western analysis and parallel cells were put 
through a BrdU ELISA to measure cell proliferation. 

25 A. Effect of siRNA treatment of TAOJIKs in cells on beta-catenin. A549 lung 

cancer cells were grown in a 16 well slide (Nalge Nunc International; Rochester, NY). 
Cells were fixed with 4% paraformaldehyde and labeled with B-catenin antibody (Cell 
Signaling), and rhodamine-conjugated phalloidin (Cytoskeleton Inc.; Denver, CO) for 
actin staining. 

30 Results: 

B-catenin localization to the plasma membrane was decreased in cells transfected 
with siRNAi against all TAOJIKs. In addition, no changes in cell-cell interactions were 
observed. 
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B. Effect of siRNA treatment of TAOJIKs on cell proliferation in cancer cell 
lines* Cell lines A549 (lung cancer line), MDA-MB-23 IT (breast cancer line), SW480, 
SW620, HCT1 16 (colon cancer cell lines) were incubated for 3 days with lOOnM duplex 
siRNA.oligonucleotides, as described above. siRNA to luciferase and cyclin Dl were 

5 used as negative and positive controls, respectively. Cells in triplicate were incubated 
with 10% alamarBlue reagent (Biosource International; Camarillo, CA) according to 
standard protocol, to measure metabolism. Cells in triplicate were incubated with BrdU 
labeling reagent (Roche; Mannheim, Germany) according to standard protocol, to measure 
DNA synthesis in S phase cells. 

10 Results: 

A549 and MDA-MB-23 IT cells treated with siRNA against SEQ ID NOs: 1 , 5, and 
12 showed approximately a 50% reduction in alamarBlue or BrdU values compared to 
mock transfection. Furthermore, SW480, SW620 and HCT116 cells treated with siRNA 
against SEQ ID NO: 12 showed approximately 50% decrease in Alamar blue or BrdU 

15 values compared to mock transfection. Thus, siRNA treatment of TAOJIKs reduces cell 
proliferation in cancer cell lines. 

C. Western blot analysis of effect of siRNA treatment of TAOJIKs on beta- 
catenin, actin, and E-cadherin expression in A549, SW620, and HCT116 cancer cell 
lines. Triplicate wells from the above experiments A and B were lysed in RIPA buffer 

20 (Boston BioProducts, Inc.; Ashland, MA) with protease inhibitors (Roche) and 

phosphatase inhibitors. Protein concentrations of cell lysates were determined by the BCA 
method (Pierce; Rockford, EL). Samples were run on a 4-12% gradient gel (Invitrogen; 
Carlsbad, CA), transferred to PVDF membrane (Bedford, MA), and probed with 
antibodies against SEQ ID NO:18 (rat TAOl, BD Biosciences; San Diego, CA ); B- 

25 catenin (Cell Signaling Technology; Beverly, MA); actin (Accurate Chemical and 
Scientific Corporation; Westbury, NY); glyceraldehyde^3-phosphate dehydrogenase 
(Advanced Immuno Chemical, Inc.; Long Beach, CA); and E-cadherin (BD Biosciences). 
Results: 

In A549 cells, siRNA treatment of TAOJIKs SEQ ID NO:l and SEQ ID NO:5 
30 decreased beta-catenin, E-cadherin and actin expression in A549 cells. . KIAA1361 

siRNA treatment of TAOJIK SEQ ID NO: 12 decreased expression of SEQ ID NO: 12 and 
E-cadherin. 
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Conclusion: 

Taken together, the RNAi and western blot experiments, in addition to reducing 
cell proliferation, suggest a link between TAOJIKs and beta-catenin function. Since the 
C. elegaiis T17E9.1 is also implicated in the beta catenin pathway, the link between 
5 TAOJIKs and the beta-catenin pathway provides compelling evidence of evolutionary 
functional conservation between these orthologs. 
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WHAT IS CLAIMED IS: 

1. A method of identifying a candidate beta-catenin pathway modulating agent, said 
method comprising the steps of: 

5 (a) providing an assay system comprising a TAOJIK polypeptide or nucleic acid; 

(b) contacting the assay system with a test agent under conditions whereby, but for 
the presence of the test agent, the system provides a reference activity; and 

(c) detecting a test, agent-biased activity of the assay system, wherein a difference 
between the test agent-biased activity and the reference activity identifies the test agent as 

10 a candidate beta-catenin pathway modulating agent. 

2. The method of Claim 1 wherein the assay system comprises cultured cells that express 
the TAOJIK polypeptide. 

15 3. . The method of Claim 2 wherein the cultured cells additionally have defective beta- 
catenin function: 

4. The method of Claim 1 wherein the assay system includes a screening assay 
comprising a TAOJIK polypeptide, and the candidate test agent is a small molecule 

20 modulator. 

5. The method of Claim 4 wherein the assay is a kinase assay. 

6. The method of Claim 1 wherein the assay system is selected from the group consisting 
25 of an apoptosis assay system, a cell proliferation assay system, an angiogenesis assay 

system, and a hypoxic induction assay system. 

7. The method of Claim 1 wherein the assay system includes a binding assay comprising a 
TAOJIK polypeptide and the candidate test agent is an antibody. 

30 

8. The method of Claim 1 wherein the assay system includes an expression assay 
comprising a TAOJIK nucleic acid and the candidate test agent is a nucleic acid 
modulator. 

44 



0305l9O5A2_l_> 



WO 03/051905 PCT/US02/39742 

9. The method of claim 8 wherein the nucleic acid modulator is an antisense oligomer. 

10. The method of Claim 8 wherein the nucleic acid modulator is a PMO. 



5 11. The method of Claim 1 additionally comprising: 

(d) administering the candidate beta-catenin pathway modulating agent identified 
in (c) to a model system comprising cells defective in beta-catenin function and, detecting 
a phenotypic change in the model system that indicates that the beta-catenin function is 
restored. 

10 

12. The method of Claim 1 1 wherein the model system is a mouse model with defective 
beta-catenin function. 

13. A method for modulating a beta-catenin pathway of a cell comprising contacting a 

15 cell defective in beta-catenin function with a candidate modulator that specifically binds to 
a TAOJIK polypeptide comprising an amino acid sequence selected from group consisting 
of SEQ ID NOs:16, 17, and 18 whereby beta-catenin function is restored. 

14. The method of claim 13 wherein the candidate modulator is administered to a 

20 vertebrate animal predetermined to have a disease or disorder resulting from a defect in 
beta-catenin function. 

15. The method of Claim 13 wherein the candidate modulator is selected from the group 
consisting of an antibody and a small molecule. 

25 

16. The method of Claim 1, comprising the additional steps of: 

(d) providing a secondary assay system comprising cultured cells or a non-human 
animal expressing TAOJIK , 

(e) contacting the secondary assay system with the test agent of (b) or an agent 

30 derived therefrom under conditions whereby, but for the presence of the test agent or agent 
derived therefrom, the system provides a reference activity; and 

(f) detecting an agent-biased activity of the second assay system, 
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wherein a difference between the agent-biased activity and the reference activity of 
the second assay system confirms the test agent or agent derived therefrom as a candidate 
beta-catenin pathway modulating agent, 

and wherein the second assay detects an agent-biased change in the beta-catenin 
5 pathway. 

17. Hie method of Claim 16 wherein the secondary assay system comprises cultured 
cells. 

10 18. The method of Claim 16 wherein the secondary assay system comprises a non-human 
animal. 

19. The method of Claim 18 wherein the non-human animal mis-expresses a beta-catenin 
pathway gene. 

15 

20. A method of modulating beta-catenin pathway in a mammalian cell comprising 
contacting the cell with an agent that specifically binds a TAOJIK polypeptide or nucleic 
acid. 

20 21. The method of Claim 20 wherein the agent is administered to a mammalian animal 
predetermined to have a pathology associated with the beta-catenin pathway. 

22. The method of Claim 20 wherein the agent is a small molecule modulator, a nucleic 
acid modulator, or an antibody. 

25 

23. A method for diagnosing a disease in a patient comprising: 

(a) obtaining a biological sample from the patient; 

(b) contacting the sample with a probe for TAOJIK expression; 

(c) comparing results from step (b) with a control; 

30 (d) determining whether step (c) indicates a likelihood of disease. 

24. The method of claim 23 wherein said disease is cancer. 
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25. The method according to claim 24, wherein said cancer is a cancer as shown in Table 
2 as having >25% expression level. 

26. The method of claim 24, wherein said cancer is stomach stromal sarcoma, breast, 
5 endometrial, or bladder cancer. 
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<210> 1 

<211> 4242 

<212> DNA 

<213> Homo sapiens 

<400> 1 

agaatttcaa atatcaggtt caggcccctg cgtgcaccag tatccggggt tcattccccg 60 

ggcgttcaaa tatcggattc agtctccatc ccgttcagat attcggggtt cagaccccac 120 

aatcagaaat ccggaattcg gcagctgtcg ccctcgacga gggggaggac tggaccgcga 180 

ggtcagatta ggttgtcacc ccctcccctc caggggaggc ttcccgggcc cgeccctcag 240 

gaagggcgaa agccgaggaa gaggtggcaa ggggaaaggt ctccttgccc ctetccctgc 300 

ttggcagagc cgctggagga ccccaggcgg aagcggaggc gctggggcac catagtgacc 3 60 

cctaccaggc caggccccac tctcagggcc cccaggggcc accatgccag ctgggggccg 420 

ggccgggagc ctgaaggacc cagatgtggc tgagctcttc ttcaaggatg acccagaaaa 480 

gctcttctct gacctccggg aaattggcca tggcagcttt ggagccgtat actttgcccg 540 
ggatgtccgg aatagtgagg tggtggccat caagaagatg tcctacagtg ggaagcagtc 
caatgagaaa tggcaagaca tcatcaagga ggtgcggttc ttacagaagc tccggcatcc 

caacaccatt cagtaccggg gctgttacct gagggagcac acggcttggc tggtaatgga 720 

gtattgcctg ggctcagctt ctgaccttct agaagtgcac aagaaacccc ttcaggaggt 780 

agagatcgca gctgtgaccc acggggcgct tcagggcctg gcatatctgc actcccacaa 840 

catgatccat agggatgtga aggctggaaa catcctgctg tcagagccag ggttagtgaa 900 

gctaggggac tttggttctg cgtccatcat ggoacctgcc aactccttcg tgggcacccc 960 

atactggatg gcacccgagg tgatcctggc catggatgag gggcagtacg atggcaaagt 1020 

ggacgtctgg. tccttgggga taacctgcat cgagctggct gaacggaaac caccgctctt 1080 

taacatgaat . gcgatgagtg ccttatacca cattgcacag aacgaatccc ccgtgctcca 1140 

gtcaggacac tggtctgagt acttccggaa ttttgtcgac tcctgtcttc agaaaatccc 1200 
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tcaagacaga ccaacctcag 
acccacagtc atcatggacc 
cctgcagtac cgcaagatga 
cgaggcccca gaggaggaag 
cagcctcgag agtagccact 
ctccgtcaac agcctagcag 
agaggaggag gaggaagaag 
gcacacagtc acctctcaea 
tgatgacccc taccagccag 
tcccacttcc accacctctt 
tgccaccatc cgaaccgcct 
fcgcgctgcgg gagcagctga 
gctggccctg gagtcacggc 
ggagcttgag gcgcagcggg 
ccaggccata ggtgagaagg 
gcacatcctt gggcagcaga 
ctacaaactt cgcaaggaac 
gcgggagaag gccgagtggc 
ggaggaagca gggctgctgc 
caagcgcaag atgttgctgg 
gaacaagaag cagacccaga 
cacgcgggag ctggagctgc 
ccgcctgcag caccagacgg 
agagttgcgg cagaagcatg 
ggagctgcag atcaagaagc 
ggctctgcga gcacacttgc 
gcggctcaag gaagagcaga 
catctcagag atgctcagct 
ccaggccctt cggcagcagc 
gatcaagatc cgcacagaga 
cgcgctgcgg cgggcactgc 
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aggttctcct 


gaagcaccgc 


tttgtgctcc 


gggagcggcc 


1260 


tgatccagag 


gaccaaggat 


gccgtgcggg 


agctggacaa 


1320 


agaagatcct 


gttccaagag 


gcacccaacg 


gccctggtgc 


1380 


aggaggccga 


gccctacatg 


caccgggccg 


ggactctgac 


1440 


cagtgcccag 


catgtccatc 


agcgcctcca 


gccagagcag 


1500 


atgcctcaga 


caacgaggaa 


gaggaggagg 


aggaggagga 


1560 


gccctgaagc 


ccgggagatg 


gccatgatgc 




1620 


gctccattat 


ccaccggctg 


ccgggctctg 


acaacctata 


1680 


agataacccc 


cagccctctc 


cagccgcctg 


cagccccagc 


1740 


ctgcccgccg 


ccgggcctac 


tgccgtaacc 


gagaccactt 


1800 


ccctggtcag 


ccgtcagatc 


caggagcatg 


agcaggactc 


1860 


gcggctataa 


gcggatgcga 


cgacagcacc 


agaagcagct 


1920 


tgaggggtga 


acgggaggag 


cacagtgcac 


ggctgcagcg 


1980 


ctggctttgg 


ggcagaggca 


gaaaagctgg 


cccggcggca 


2040 


aggcacgagc 


tgcccaggcc 


gaggagcgga 


agttccagca 


2100 


agaaggagct 


ggctgccctg 


ctggaggcac 


agaagcggac 


2160 


agctgaagga 


ggagctccag 


gagaacccca 


gcactcccaa 


2220 


tgctgcggca 


gaaggagcag 


ctccagcagt 


gccaggcgga 


2280 


ggcggcagcg 


ccagtacttt 


gagctgcagt 


gtcgccagta 


2340 


ctcggcacag 


cctggaccag 


gacctgctgc 


9rggaggacct 


2400 


aggacttgga 


gtgtgcactg 


ctgcttcggc 


agcacgaggc 


2460 


ggcagctcca 


ggccgtgcag 


cgcacgcggg 


ctgagctcac 


2520 


agctgggcaa 


ccagctggag 


tacaacaagc 


ggcgtgagca 


2580 


cggcccaggt 


tcgccagcag 


cccaagagcc 


tcaaatctaa 


2640 


agttccagga 


gacgtgtaag 


atccagactc 


ggcagtacaa 


2700 


tggagaccac 


gcccaaagct 


cagcacaaga 


gcctccttaa 


2760 


cccgcaagct 


ggcgatcttg 


gcggagcagt 


atgaccagtc 


2820 


cacaggcgct 


gcggcttgat 


gagacccagg 


aggcagagtt 


2880 


ttcaacagga 


gctggagctg 


ctcaacgctt 


accagagcaa 


2940 


gccagcacga 


gagggagctg 


cgggagctgg 


agcagagggt 


3000 


tggagcagcg 


ggtggaagag 


gagctgctgg 


ccctgcagac 


3060 
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aggacgctcc 


gagcgaatcc 


gcagtctgct 


tgagcggcag 


gcccgtgaga 


tcgaggcctt 


3120 


cgatgcggaa 


agcatgaggc 


tgggcttctc 


cagcatggct 


ctggggggca 


tcccggctga 


3180 


agctgctgcc 


cagggctatc 


ctgctccacc 


ccctgcccca 


gcctggccct 


cccgtcccgt 


3240 


tccccgttct 


ggggcacact 


ggagccatgg 


ccctcctcca 


ccaggcatgc 


cccctccagc 


3300 


ctggcgtcag 


ccgtctctgc 


tggctccccc 


aggcccccca 


aactggctgg 


ggccccccac 


3360 


acaaagtggg 


acaccccgtg 


gcggagccct 


gctgctgcta 


agaaacagcc 


cccagcccct 


3420 


gcggcgggca 


gcctcggggg gcagtggcag 


tgagaatgtg 


ggcccccctg 


ctgccgcggt 


3480 


gcccgggccc 


ctgagccgca 


gcaccagtgt 


cgcttcccac 


atcctcaatg 


gtfccttccca 


3540 


cttctattcc 


tgaggtgcag 


cggggaggag 


cagatgagct 


gggcagggca 


ggggtgggtg 


3600 


gagcctgacc 


ctggagggca 


ctgagctgga 


ggcccctgca 


agggtagggg 


acaagatgta 


3660 


ggctccagct 


cccctcagac 


ctcctcatct 


catgagcttc 


ttggggctgg 


ccagtggccc 


3720 


agggccagct 


tggcgataga 


fcgcctcaagg 


ctgcctggga 


gccccgcctc 


cctaccatgg 


3780 


tgccaggggt 


ctccctccgc 


cacctaggaa 


aggagggaga 


tgtgcgtgtc 


aaatattcat 


3840 


ctagtcccct 


gggggagggg 


aagggtgggt 


ctagacatac 


tatattcaga 


gaactatact 


3900 


accctcacag 


tgaggccctc 


agacctgcca 


cagggcagag 


caggtctggg 


gcctgaggca 


3960 


gggagaatga 


gaggccacct 


tactggcagg 


aaggatcagg 


atggggtcfcfc 


ggggtcagga 


4020 


fcgccfcgggfcc 


tcttcccgta 


actgtctgac 


^ ^ /"f 1 » t~*r /—+ 

gtCCuytgCC 
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tttttttttt 


taattgggat 


cagggctggg 


gcggggaaac 


aagggaagga 


ccttggaagg 


4140 


ggctgctccc 


aggcctgggg 


ggcagtcgtg 


ggagcccctc 


tcagctgtgg 


ggctggcaca 


4200 


gagccccagg 


caagctttta 


ataaactgtt 


ggttattcta 


ac 




4242 



<210> 2 

<211> 3069 

<212> DNA ' 

<213> Homo sapiens 

<400> 2 



gagaccggga 


cgagaccggg 


gctgtggtgc 


ggagagaggc 


tgagacggag 


aagaggagag 


60 


gcagagaggg 


cgcggggacc 


gtcagcagca 


ccttagctac 


aatcgttcag 


ctattctcgg 


120 


aagagagaag 


ggagagggag 


gaggccgggg 


cgggagtggg 


ggctgtcacc 


ctcggacccc 


180 


ggcgtgagag 


gggccgtgcg 


gccggacgtc 


ctcggggtgg 


gcccccagtc 


ggtggecgaa 


240 


gacctacagc 


tcaggcccct 


gggtcccaaa 


tttccaggct 


ttgcccctcc 


tcctttctca 


300 


gatacccggg 


taacagtcct 


catagtccag 


atatccggga 


ctcgggtccc 


aacctctcta 


360 


aacctgggtc 


tctgtttcat 


agaatttcaa 


atatcaggtt 


caggcccctg 


cgtgcaccag 


420 


tatccggggt 


tcattccccg 


ggcgttcaaa 


tatcggattc 


agtctccatc 


ccgttcagat 


480 
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attcggggtt 


cagaccccac 


aatcagaaat 


ccggaattcg 


gcagctgtcg 


ccctcgacga 


540 


gggggaggac 


tggaccgcga 


ggtcagatta 


ggttgtcacc 


ccctcccctc 


caggggaggc 


600 


ttcccgggcc 


cgcccctcag 


gaagggcgaa 


agccgaggaa 


gaggtggcaa 


ggggaaaggt 


660 


ctccttgccc 


ctctccctgc 


ttggcagagc 


cgctggagga 


ccccaggcgg 


aagcggaggc 


720 


gctggggcac 


catagtgacc 


cctaccaggc 


caggccccac 


tctcagggcc 


cccaggggcc 


780 


accatgccag 


ctgggggccg 


ggccgggagc 


ctgaaggacc 


cagatgtggc 


tgagctcttc 


840 


ttcaaggatg 


acccagaaaa 


gctcttctct 


gacctccggg 


aaattggcca 


tggcagcttt 


900 


ggagccgtat 


acttfcgcccg 


ggatgtccgg 


aatagtgagg 


tggtggccat 


caagaagatg 


960 


tcctacagtg 


ggaagcagtc 


caatgagaaa 


tggcaagaca 


tcatcaagga 


ggtgcggttc 


1020 


ttacagaagc 


tccggcatcc 


caacaccatt 


cagtaccggg 


gctgttacct 


gagggagcac 


1080 


acggcttggc 


tggtaatgga 


gtattgcctg 


ggctcagctt 


ctgaccttct 


agaagtgcac 


1140 


aagaaacc cc 


ttcaggaggt 


agagatcgca 


gctgtgaccc 


acggggcgct 


tcagggcctg 


1200 


gcatatctgc 


actcccacaa 


catgatccat 


agggatgtga 


aggctggaaa 


catcctgctg 


1260 


tcagagccag 


ggttagtgaa 


gctaggggac 


tttggttctg 


cgtccatcat 


ggcacctgcc 


1320 


aactccttcg 


tgggcacccc 


atactggatg 


gcacccgagg 


tgatcctggc 


catggatgag 


1380 


gggcagtacg 


atggcaaagt 


ggacgtctgg 


tccttgggga 


taacctgcat 


cgagctggct 


1440 


gaacggaaac 


caccgctctt 


taacatgaat 


gcgatgagtg 


ccttatacca 


cattgcacag 


1500 


aacgaatccc 


ccgtgctcca 


gtcaggacac 


tggtctgagt 


acttccggaa 


ttttgtcgac 


1560 


fccctgtcttc 


agaaaatccc 


tcaagacaga 


ccaacctcag 


aggttctcct 


gaagcaccgc 


1620 


tttgtgctcc 


gggagcggcc 


acccacagtc 


atcatggacc 


tgatccagag 


gaccaaggat 


1680 


gccgtgcggg 


agctggacaa 


cctgcagtac 


cgcaagatga 


agaagatcct 


gttccaagag 


1740 


gcacccaacg 


gccctggtgc 


cgaggcccca 


gaggaggaag 


aggaggccga 


gccctacatg 


1800 


caccgggccg 


ggactcfcgac 


cagcctcgag 


agtagccact 


cagtgcccag 


catgtccatc 


1860 


agcgcctcca 


gccagagcag 


ctccgtcaac 


agcctagcag 


atgcctcaga 


caacgaggaa 


1920 


gaggaggagg 


aggaggagga 


agaggaggag 


gaggaagaag 


gccctgaagc 


ccgggagatg 


1980 


gccatgatgc 


aggaggggga 


gcacacagtc 


acctctcaca 


gctccattat 


ccaccggctg 


2040 


ccgggcfcctg 


acaacctata 


tgatgacccc 


taccagccag 


agataacccc 


cagccctctc 


2100 


cagccgcctg 


cagccccagc 


tcccacttcc 


accacctctt 


ctgcccgccg 


ccgggcctac 


2160 


tgccgtaacc 


gagaccactt 


tgccaccatc 


cgaaccgcct 


ccctggtcag 


ccgtcagatc 


2220 


caggagcatg 


agcaggactc 


tgcgctgcgg 


gagcagctga 


gcggctataa 


gcggatgcga 


2280 


cgacagcacc 


agaagcagct 


gctggccctg 


gagtcacggc 


tgaggggtga 


acgggaggag 


2340 
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cacagtgcac 


ggctgcagcg 


ggagctfcgag 


gcgcagcggg 


ctggctttgg 


ggcagaggca 


2400 




gaaaagctgg 


cccggcggca 


ccaggccata 


ggtgagaagg 


aggcacgagc 


tgcccaggcc 


2460 




gaggagcgga 


agttccagca 


gcacatcctt 


gggcagcaga 


agaaggagct 


ggctgccctg 


2520 




ctggaggcac 


agaagcggac 


ctacaaactt 


cgcaaggaac 


agctgaagga 


ggagctccag 


2580 




gagaacccca 


gcactcccaa 


gcgggagaag 


gccgagtggc 


tgctgcggca 


gaaggagcag 


2640 




ctccagcagt 


gccaggcgga 


ggaggaagca 


gggctgctgc 


ggcggcagcg 


ccagtacttt 


2700 


"> 


gagctgcagt 


gtcgccagta 


caagcgcaag 


atgttgctgg 


ctcggcacag 


cctggaccag 


2760 




gacctgctgc 


gggaggacct 


gaacaagaag 


cagacccaga 


aggacttgga 


gtgtgcactg 


2820 




ctgcttcggc 


agcacgaggc 


cacgcgggag 


ctggagctgc 


ggcagctcca 


ggccgtgcag 


2880 




CgCaCyCyyy 
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oyL. Lyyy uu 




2940 


-. 


tacaacaagc 


ggcgtgagca 


agagttgcgg 


cagaagcatg 


cggcccaggt 


tcgccagcag 


3000 




cccaagagcc 


tcaaaaaaaa 


gacaaacaca 


aataaaatat 


ctgagcggaa 


aaaaaaaaaa 


3060 




aaaaaaaaa 
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<210> 3 

<211> 3254 

<212> DNA 

<213> Homo sapiens 














<400> 3 
ctggaggacc 


ccaggcggaa 


gcggaggcgc 


tggggcacca 


tagtgacccc 


taccaggcca 


60 




ggccccactc 


tcagggcccc 


caggggccac 


catgccagct 


gggggccggg 


ccgggagcct 


120 




gaaggac c c a 


gatgtggctg 


agctcttctt 


caaggatgac 


ccagaaaagc 


tcttctctga 


180 




cctccgggaa 


attggccatg 


gcagctttgg 


agccgtatac 


tttgcccggg 


atgtccggaa 


240 




tagtgaggtg 


gtggccatca 


agaagatgtc 


ctacagtggg 


aagcagtcca atgagaaatg 


300 


'v& . . . 


gcaagacatc 


atcaaggagg 


tgcggttctt 


acagaagctc 


cggcatccca 


acaccattca 


360 




gtaccggggc 


tgttacctga 


gggagcacac 


ggcttggctg 


gtaatggagt 


attgcctggg 


420 




ctcagcttct 


gaccttctag 


aagtgcacaa 


gaaacccctt 


caggaggtag 


agatcgcagc 


480 




tgtgacccac 


ggggcgcttc 


agggcctggc 


atatctgcac 


tcccacaaca 


tgatccatag 


540 




ggatgtgaag 


gctggaaaca 


tcctgctgtc 


agagccaggg 


ttagtgaagc 


taggggactt 


600 




tggttctgcg 


tccatcatgg 


cacctgccaa 


ctccttcgtg 


ggcaecccat 


actggatggc 


660 


-£> 


acccgaggtg 


atcctggcca 


fcggatgaggg 


gcagtacgat 


ggcaaagtgg 


acg.tctggtc 


720 




cttggggata 


acctgcatcg 


agctggcfcga 


acggaaacca 


ccgctcttta 


acatgaatgc 


780 




gatgagtgcc 


ttataccaca 


ttgcacagaa 


cgaatccccc 


gtgctccagt 


caggacactg 


840 
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gtctgagtac 
aacctcagag 
catggacctg 
caagatgaag 
ggaggaagag 
tagccactca 
cctagcagat 
ggaagaaggc 
ctctcacagc 
ccagccagag 
cacctcttcc 
aaccgcctcc 
gc age t gage 
gtcacggctg 
geageggget 
tgagaaggag 
gcagcagaag 
caaggaacag 
cgagtggctg 
getgetgegg 
gttgctggct 
gacccagaag 
ggagctgegg 
ecagaeggag 
gaagcatgcg 
caagaagcag 
acacttgetg 
agagcagacc 
gctcagctca 
gcagcagctt 
cacagagagc 



ttceggaatt 
gttctcctga 
atccagagga 
aagatcctgt 
gaggecgage 
gtgcccagca 
gcctcagaca 
cctgaagccc 
tccattafccc 
ataaccccca 
gcccgccgcc 
ctggtcagcc 
ggctataagc 
aggggtgaac 
ggctttgggg 
gcacgagctg 
aaggagctgg 
ctgaaggagg 
ctgeggcaga 
cggcagcgcc 
cggcacagcc 
gacttggagt 
cagctccagg 
ctgggcaacc 
gcccaggttc 
ttccaggaga 
gagaccacgc 
cgcaagctgg 
caggcgctgc 
caacaggagc 
c age ac gaga 



ttgtcgactc 
agcaccgctt 
ecaaggatge 
tccaagaggc 
cctacatgca 
tgtccatcag 
acgaggaaga 
gggagatggc 
accggctgcc 
gccctctcca 
gggcctactg 
gtcagatcca 
ggatgcgacg 
gggaggagca 
cagaggcaga 
cccaggccga 
ctgccctgct 
agctccagga 
aggagcagct 
agtactttga 
tggaccagga 
gtgeactget 
ccgtgcagcg 
agctggagta 
gc cage age c 
cgtgtaagat 
ccaaagctca 
cgatcttggc 
ggcttgatga 
tggagctget 
gggagctgcg 



ctgtcttcag 
tgtgctccgg 
cgtgcgggag 
acccaacggc 
ccgggccggg 
cgcctccagc 
ggaggaggag 
catgatgeag 
gggctctgac 
gccgcctgca 
ccgtaaccga 
ggagcatgag 
acagcaccag 
cagtgeaegg 
aaagctggcc 
ggagcggaag 
ggaggcacag 
gaaccccagc 
ccagcagtgc 
gctgcagtgt 
cctgctgcgg 
getteggcag 
cacgcgggct 
caacaagegg 
caagagcctc 
ccagactcgg 
gcacaagagc 
ggagcagtat 
gacccaggag 
caacgcttac 
ggagctggag 



aaaatccctc 
gagcggccac 
ctggacaacc 
cctggtgccg 
actctgacca 
cagagcagct 
gaggaggaag 
gagggggagc 
aacctatatg 
gccccagctc 
gaccactttg 
caggactctg 
aagcagctgc 
ctgcagcggg 
cggcggcacc 
ttccagcagc 
aageggaect 
actcccaagc 
caggeggagg 
cgccagtaca 
gaggacctga 
cacgaggcca 
gagctcaccc 
cgtgagcaag 
aaatctaagg 
cagtacaagg 
ctccttaagc 
gaccagtcca 
gcagagttcc 
cagagcaaga 
cagagggtcg 
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aagacagacc 900 

ccacagtcat 960 

tgcagtaccg 1020 

aggccccaga 1080 

gectcgagag 1140 

ccgtcaacag 1200 

aggaggagga 1260 

acacagtcac 1320 

atgaccccta 1380 

ccacttccac 1440 

ccaccatccg 1500 

cgctgcggga 1560 

tggccctgga 1620 

agcttgaggc 1680 

aggecatagg 1740 

acatccttgg 1800 

acaaacttcg 1860 

gggagaaggc 19 20 

aggaagcagg 19 8 0 

agegcaagat 2 040 

acaagaagca 2100 

cgegggaget 2160 

gcctgcagca 2220 

agttgcggca 22 80 

agetgeagat 2340 

c tc tgegage 2400 

ggctcaagga 2460 

tctcagagat 2520 

aggcccttcg 2 580 

tcaagatccg 2640 

cgctgcggcg 27 00 



WO 03/051905 

ggcactgctg gagcagcggg tggaagagga 
gcgaatccgc agtctgcttg agcggcaggc 
catgaggctg ggcttctcca gcatggctct 
gggctatcct gctccacccc ctgccccagc 
ggcacactgg agccatggcc ctcctccacc 
gtctctgctg gctcccccag gccccccaaa 
accccgtggc ggagccctgc tgctgctaag 
ctcggggggc agtggcagtg agaatgtggg 
gagccgcagc accagtgtcg cttcccacat 
aggtgcagcg ggga 

<210> 4 

<211> 4971 

<212> DNA 

<213> Homo sapiens 

<400> 4 

aattcggeac gagctgagac ggagaagagg 
age acq t tag etacaategt tcagctattc 
ggggegggag tgggggctgt caccctcgga 
cgtcctcggg . gtgggccccc agtcggtggc 
caaatttcca ggctttgc.ee ctcctccttt 
ccagatatcc gggactcggg tcccaacctc 
tcaaatatca ggttcaggcc cctgcgtgca 
tcaaatatcg gattcagtct ccatcccgtt 
gaaatccgga atteggcage tgtcgccctc 
gattaggttg tcaccccctc ccctccaggg 
gcgaaagccg aggaagaggt ggcaagggga 
agagcegctg gaggacccca ggcggaagcg 
caggccaggc c.ccactctca gggcccccag 
ggagcctgaa ggacccagat gtggctgagc 
tctctgacct cegggaaatt ggccatggca 
teeggaatag tgaggtggtg gecatcaaga 
agaaatggca agacatcatc aaggaggtgc 
ccattcagta ccggggctgt tacctgaggg 



PCT/US02/39742 

gctgctggcc ctgcagacag gacgctccga 27 60 
ccgtgagatc gaggectteg atgcggaaag 2820 
ggggggcatc ccggctgaag ctgctgccca 2880 
ctggccctcc cgtcccgttc cccgttctgg 2940 
aggcatgccc cctccagcct ggcgtcagcc 3000 
ctggctgggg ccccccacac aaagtgggac 3 060 
aaacagcccc cagcccctgc ggegggcage 3120 
cccccctgct gccgcggtgc ccgggcccct 3180 
cctcaatggt tcttcccact tctattcctg 3240 

3254 



agaggcagag agggegeggg gaccgtcagc 60 

teggaagaga gaagggagag ggaggaggee 120 

ccccggcgtg agaggggecg tgcggccgga 180 

cgaagaccta cagctcaggc cc ctgggtcc 240 

ctcagatacc egggtaacag tcctcatagt 300 

tctaaacctg ggtctetgtt tcatagattt 3 60 

ccagtatccg gggttcattc cccgggcgtt 420 

cagatattcg gggttcagac cccacaatca 480 

gacgaggggg aggactggac cgegaggtea 540 

gaggcttccc gggcccgccc cteaggaagg 600 

aaggtctcct tgcccctctc cctgcttggc 660 

gaggegctgg ggcaccatag tgacccctac 720 

gggccaccat gccagctggg ggccgggccg 780 

tcttcttcaa ggatgaccca gaaaagctct 840 

gctttggagc cgtatacttt gecegggatg 900 

agatgtccta cagtgggaag cagtccaatg 960 

ggttcttaca gaagctcegg catcccaaca 1020 

agcacacggc ttggctggta atggagtatt 1080 



,03051905A2J_> 
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gcctgggctc 


agcttctgac 


cttctagaag 


tgcacaagaa 


accccttcag 


gaggtagaga 


1140 


tcgcagctgt 


gacccacggg 


gcgcttcagg 


gcctggcata 


tctgcactcc 


cacaacatga 


1200 


tccataggga 


tgtgaaggct 


ggaaacatcc 


tgctgtcaga 


gccagggtta 


gtgaagctag 


12 60 


gggactttgg 


ttctgcgtcc 


atcatggcac 


ctgccaactc 


cttcgtgggc 


accccatact 


1320 


ggatggcacc 


cgaggtgatc 


ctggccatgg 


atgaggggca 


gtacgatggc 


aaagtggacg 


1380 


tctggtcctfc 


ggggataacc 


tgcatcgagc 


tggctgaacg 


gaaaccaccg 


ctctttaaca 


1440 


tgaatgcgat 


gagtgcctta 


taccacattg 


cacagaacga 


atcccccgtg 


ctccagtcag 


1500 


gacactggtc 


tgagtacttc 


cggaattttg 


tcgactcctg 


tcttcagaaa 


atccctcaag 


1560 


acagaccaac 


ctcagaggtt 


ctcctgaagc 


accgctttgt 


gctccgggag 


cggccaccca 


1620 


cagtcatcat 


ggacctgatc 


cagaggacca 


aggatgccgt 


gcgggagctg gacaacctgc 


1680 


agtaccgcaa 


gatgaagaag 


atcctgttcc 


aagaggcacc 


caacggccct 


ggtgccgagg 


1740 


ccccagagga 


ggaagaggag 


gccgagccct 


acatgcaccg 


ggccgggact 


ctgaccagcc 


1800 


tcgagagtag ccactcagtg cccagcatgt 


ccatcagcgc 


ctccagccag 


age age tc eg 


1860 


tcaacagcct 


agcagatgcc 


tcagacaacg 


aggaagagga 


ggaggaggag 


gaggaagagg 


1920 


aggaggagga 


agaaggccct 


gaagcccggg 


agatggccat 


gatgcaggag 


ggggagcaca 


1980 


cagtcacctc 


tcacagctcc 


attatccacc 


ggctgccggg 


ctctgacaac 


ctatatgatg 


2040 


acccctacca 


gccagagata 


acccccagec 


ctctccagcc 


gcctgcagcc 


ccagctccca 


2100 


cttccaccac 


ctcttccgcc 


cgccgccggg 


cctactgccg 


taaccgagac 


cactttgcca 


2160 


ccatccgaac 


cgcctccctg gtcagccgtc 


agatccagga 


gcatgagcag 


gactctgcgc 


2220 


fcgcgggagca 


gctgagcggc 


tataagcgga 


tgcgacgaca 


gcaccagaag 


cagctgetgg 


2280 


ccctggagtc 


acggctgagg ggtgaacggg 


aggagcacag 


tgcacggctg 


cagegggage 


2340 


ttgaggcgca 


gcgggctggc 


tttggggcag 


aggcagaaaa 


gctggcccgg 


cggcaccagg 


2400 


ccataggtga 


gaaggaggca 


cgagctgccc 


ajggccgagga 


gcggaagttc 


cagcagcaca 


2460 


tccttgggca 


gcagaagaag 


gagctggctg 


ccctgctgga 


ggcacagaag 


cggacctaca 


2520 


aacttcgcaa 


ggaacagctg aaggaggagc 


tccaggagaa 


ccccagcact 


cccaagcggg 


2580 


agaaggccga 


gtggctgctg 


cggcagaagg 


agcagctcca 


gcagtgccag 


geggaggagg 


2640 


aagcagggct 


gctgcggcgg 


cagcgccagt 


acfcttgagct 


gcagtgtcgc 


cagtacaagc 


2700 


gcaagatgtt 


gctggctcgg cacagcctgg accaggacct 


gctgcgggag 


gacctgaaca 


2760 


agaagcagac 


ccagaaggac 


ttggagtgtg 


cactgctgct 


tcggcagcac 


gaggccacgc 


2820 


gggagctgga 


gctgcggcag 


ctccaggccg 


tgcagcgcac 


gcgggctgag 


ctcacccgcc 


2880 


tgcagcacca 


gacggagctg ggcaaccagc 


tggagtacaa 


caagcggcgt 


gagcaagagt 


2940 
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tgcggcagaa gcatgcggcc caggttcgcc 


agcagcccaa gagcctcaaa 


gtacgtgcag 


3000 


gccagcgccc 


cccgggcctt 


ccactcccea 


ttcctggggc 


tctgggccca . 


cccaacacag 


3060 


gcacccctat 


agaacagcag 


ccetgctcac 


ctggccagga 


ggcagtcctg 


gaccaaagaa 


3120 


tgcttggcga 


ggaggaggaa 


gcagttggag 


agagaaggat 


tctgggaaag 


gaaggggcca 


3180 


ctttggagcc 


caagcagcag 


aggattctgg 


gggaagaatc 


aggagcccct 


agtcccagtc 


3240 


cacaaaaaca 


tgggagcctg 


gttgatgagg 


aagtttgggg 


tctgcctgag 


gagatagagg 


3300 


v agcttagggt 


gccctccctt 


gtaccccagg 


agaggagcat 


tgttggccag 


gaggaggctg 


3360 


ggacgtggag 


cttgtggggg 


aaggaggatg 


agagtcttct 


ggatgaggag 


tttgagcttg 


3420 


gctgggtcca gggcccagca ctgactcccg 


tccctgagga 


ggaggaagaa 


gaggaagagg 


3480 


gggctccgat 


tgggacccct 


agggatcctg 


gagatggttg 


tccttccccc 


gacatccctc 


3540 


ctgaaccccc 


tccaacacac 


ctg;aggccct 


gccctgccag 


ccagctccct 


ggactcctgt 


3600 


cccatggcct 


cctggccggc 


ctctcctttg 


cagtggggtc 


ctcctctggc 


ctcctgcccc 


3660 


tcctgctgct 


gctgctgctt 


ccattgctgg 


cagcccaggg 


tgggggtggc 


ctgcaggcag 


3720 


cgctgctggc 


ccttgaggtg 


gggctggtgg 


gtctgggggc 


ctcctacctg ctcctttgta 


3780 


cagccctgca 


cctgccctcc 


agtcttttcc 


tactcctggc 


ccagggtacc 


gcactggggg 


3840 


ccgtcctggg 


cctgagctgg 


cgccgaggcc 


tcatgggtgt 


tcccctgggc 


cttggagctg 


3900 


cctggctctt 


agcttggcca ggcctagctc 


tacctctggt 


ggctatggca 


gcggggggca 


3960 


gatgggtgcg gcagcagggc 


ccccgggtgc 


gccggggcat 


atctcgactc 


tggttgcggg 


4020 


ttctgctgcg 


cctgtcaccc 


atggccttcc 


gggccctgca 


gggctgtggg 


gctgtggggg 


4080 


accggggtct 


gfcttgcactg 


taccccaaaa 


ccaacaagga 


tggcttccgc 


agccgcctgc 


4140 


ccgtccctgg 


gccccggcgg 


cgtaatcccc 


gcaccaccca 


acacccatta 


gctctgttgg 


4200 


caagggtctg 


ggtcctgtgc 


aagggctgga 


actggcgtct 


ggcacgggcc 


agccagggtt 


4260 


tagcatccca 


cttgcccccg 


tgggccatcc 


acacactggc 


cagctggggc 


ctgcttcggg 


4320 


„ gtgaacggcc 


cacccgaatc 


ccccggctac 


taccacgcag 


ccagcgccag 


ctagggcccc 


4380 


ctgcctccca 


ccagccactg ccagggactc 


tagccgggcg 


gaggtcacgc 


acccgccagt 


4440 


cccgggccct 


gcccccctgg 


aggtagctga 


ctccagccct 


tccagcccaa 


atctagagca 


4500 


ttgagcactt 


tatctcccac 


gactcagtga 


agtttctcca 


gtccctagtc 


ctctcttttc 


4560 


acccaccttc 


ctcagtttgc 


tcactfcaccc 


caggcccagc ccttcggacc 


tctagacagg 


4620 


cagcctcctc 


agctgtggag 


tccagcagtc 


actctgtgtt 


ctcctggcgc 


tcctccccta 


4680 


agttattgct 


gttcgcccgc 


tgtgtgtgct 


catcctcacc 


ctcattgact 


caggcctggg 


4740 


gccagggg : tg 


gtggagggtg 


ggaagagtca 


tgtttttttt 


ctcctctttg 


attttgtttt 


4800 
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4971 



60 
120 
180 
240 



tctgtctccc ttccaacctg tccccttccc cccaccaaaa aaagaaaaag acaaacacaa 4860 
ataaaatatc tgagcggaac tgtgaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4920 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 

<210> 5 
<211> 2958 
<212> DNA 
<213> Homo sapiens 

acggccatta ccaatcgcga aaccaaagga ctgaagttat aaaagagaaa agagaagttt 
gctgctaaaa tgaatctgag caatatggaa tattttgtgc cacacacaaa aaggtactga 
agatttaccc cccaaaaaaa attgtcaatg agaaataaag ctaactgata tcaaaaagca 
gagcctgctg tactggccat catgcgtaaa ggggtgctga aggacccaga gattgccgat 
ctattctaca aagatgatcc tgaggaactt tttattggtt tgcatgaaat tggacatgga 300 
agttttggag cagtttattt tgctacaaat gctcacacca gtgaggtggt ggcaattaag 
aagatgtcct atagtgggaa gcagacccat gagaaatggc aagatattct taaggaagtt 
aaatttttac gacaattgaa gcatcctaat actattgagt acaaaggctg ttacttgaaa 
gaacacactg cttggttggt gatggaatat tgcttaggct cagcctctga tttattagaa 
gttcataaaa aaccacttca ggaagtggag atcgctgcca ttactcatgg agccttgcat 
ggactagcct acctacattc tcatgcattg attcataggg atattaaagc aggaaatatt 
cttctaacag agccaggtca ggtaaaacta gctgattttg gatctgcttc aatggcttct 
cctgccaact ccttcgtggg cacaccttac tggatggctc cagaagtgat cttagctatg 
gatgaaggac agtatgatgg gaaagttgat atttggtcac ttggcatcac ttgtattgaa 
ttggcggaac ggaagccgcc ccttttcaac atgaatgcaa tgagtgcctt atatcacatt 
gcccagaatg actccccaac gttacagtct aatgaatgga cagactcctt taggagattt 
gttgattact gcttgcagaa aatacctcag gaaaggccaa catcagcaga actattaagg 
catgactttg ttcgacgaga ccggccacta cgtgtcctca ttgacctcat acagaggaca 
aaagatgcgg ttcgtgagct agataaccta cagtaccgaa aaatgaaaaa aatacttttc 
caagagacac ggaatggacc cttgaatgag tcacaggagg atgaggaaga cagtgaacat 
ggaaccagcc tgaacaggga aatggacagc ctgggcagca accattccat tccaagcatg 
tccgtaacat ggaaccagcc tgaacaggga aatggacagc ctgggcagca accattccat 
tccaagcatg tccgtgtcat gatgcacgat gacgaaagca caatcaattc cagctcctcc 
gtcgtgcata agaaagatca tgtattcata agggatgagg cgggccacgg cgatcccagg 1440 
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360 
420 
480 
540 
600 
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720 
780 
840 
900 
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1020 
1080 
1140 
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1260 
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cctgagccgc 


ggcctaccca 


gtcagttcag 


agccaggccc 


tccactaccg 


gaacagagag 


1500 


cgctttgcca 


cgatcaaatc 


agcatctttg 


gttacacgac 


agatccatga 


gcatgagcag 


1560 


gagaacgagt 


tgegggaaca 


gatgtcaggt 


tataagegga 


tgcggcgcca 


gcaccagaag 


1620 


cage t gat eg 


ccctggagaa 


caagctgaag 


gctgagatgg 


acgagcaccg 


cctcaagcta 


1680 


cagaaggagg 


tggagacgea 


tgccaacaac 


tcgtccatcg 


agctggagaa 


gctggccaag 


1740 


aagcaagtgg 


ctatcataga 


aaaggaggca 


aaggtagctg 


cagcagatga 


gaagaagttc 


1800 


cagcaacaga 


tcttggccca 


gcagaagaaa 


gatttgacaa 


ctttcttaga 


aagtcagaag 


1860 


aagcagtata 


agatttgtaa 


ggaaaaaata 


aaagaggaaa 


tgaatgagga 


ccatagcaca 


1920 


cccaagaaag 


agaagcaaga 


gcggatctcc 


aaacataaag 


agaacttgea 


gcacacacag 


1980 


gctgaagagg 


aagcccacct 


tctcactcaa 


cagagactgt 


actacgacaa 


aaattgtcgt 


2040 


ttcttcaagc 


ggaaaataat 


gatcaagegg 


cacgaggtgg 


agcagcagaa 


cattegggag 


2100 


gaactaaata 


aaaagaggac 


ccagaaggag 


atggagcatg 


ecatgetaat 


ccggcacgac 


2160 


gagtccaccc 


gagagctaga 


gtacaggcag 


ctgcacacgt 


tacagaagct 


aegcatggat 


2220 


ctgatccgtt 


tacagcacca 


gaeggaactg 


gaaaaccagc 


tggagtacaa 


taagaggega 


2280 


gaaagagaac 


tgcacagaaa 


gcatgtcatg 


gaacttegge 


aacagecaaa 


aaacttaaag 


2340 


gc c at ggaaa 


tgcaaattaa aaaacagttt 


caggacactt 


gcaaagtaca 


gaccaaacag 


2400 


tataaagcac 


tcaagaatca 


ccagttggaa 


gttactccaa 


agaatgagca 


caaaacaatc 


2460 


ttaaagacac 


tgaaagatga 


gcagacaaga 


aaacttgeca 


ttttggcaga 


gcagtatgaa 


2520 


cagagtataa 


atgaaatgat 


ggcctctcaa 


gcgttacggc 


tagatgaggc tcaagaagca 


2580 


gaatgecagg 


ccttgaggct 


acagctccag 


eaggaaatgg 


agetgetcaa 


cgcctaccag 


2640 


agcaaaatca 


agatgeaaac 


agaggcacaa 


catgaaegtg 


agctccagaa 


gctagagcag 


2700 


agagtgtctc 


tgegcagage 


acaccttgag 


cagaagattg 


aagaggagct 


ggctgccctt 


2760 


cagaaggaac 


geagegagag 


aataaagaac 


ctattggaaa 


ggcaagagcg 


agagattgaa 


O n i r\ 

z 820 


acttttgaca 


tggagagect 


cagaatggga 


tttgggaatt 


tggttacatt agattttcct 


2880 


aaggaggact 


acagatgaga 


ttaaattttt 


ttccatttac 


aaaaaaaaaa 


aaaaaaaaaa 


2940 


aaaaaaaaaa 


aaaaaaaa 










2958 



<210> 6 

<211> 2897 

<212> DNA 

<213> Homo sapiens 

<400> 6 

caaaggactg aagttataaa agagaaaaga gaagtttgct gctaaaatga atctgagcaa 
tatggaatat tttgtgccac acacaaaaag gtactgaaga tttacecccc aaaaaaaatt 
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gtcaatgaga aataaagcta actgatatca aaaagcagag cctgctctac tggccatcat 
gcgtaaaggg gtgctgaagg acccagagat tgccgatcta ttctacaaag atgatcctga 
ggaacttttt attggtttgc atgaaattgg acatggaagt tttggagcag tttattttgc, 
tacaaatgct cacaccagtg aggtggtggc aattaagaag atgtcctata gtgggaagca 
gacccatgag aaatggcaag atattcttaa ggaagttaaa tttttacgac aattgaagca 
tcctaatact attgagtaca aaggctgtta cttgaaagaa cacactgctt ggttggtgat 
ggaatattgc ttaggctcag cctctgattt attagaagtt cataaaaaac cacttcagga 
agtggagatc gctgccatta ctcatggagc cttgcatgga ctagcctacc tacattctca 
tgcattgatt catagggata ttaaagcagg aaatattctt ctaacagagc caggtcaggt 
aaaactagct gattttggat ctgcttcaat ggcttctcct gccaactcct tcgtgggcac 
accttactgg atggctccag aggtgatctt agctatggat gaaggacagt atgatgggaa 
agttgatatt tggtcacttg gcatcacttg tattgaattg gcggaacgga agccgcccct 
tttcaacatg aatgcaatga gtgccttata tcacattgcc cagaatgact ccccaacgtt 
acagtctaat gaatggacag actcctttag gagatttgtt gattactgct tgcagaaaat 
acctcaggaa aggccaacat cagcagaact attaaggcat gactttgttc gacgagaccg 
gccactacgt gtcctcattg acctcataca gaggacaaaa gatgcagttc gtgagctaga 
taacctacag taccgaaaaa tgaaaaaaat acttttccaa gagacacgga atggaccctt 
gaatgagtca caggaggatg aggaagacag tgaacatgga accagcctga acagggaaat 
ggacagcctg ggcagcaacc attccattcc aagcatgtcc gtgagcacag gcagccagag 
cagcagtgtg aacagcatgc aggaagtcat ggacgagagc agttccgaac ttgtcatgat 
gcacgatgac gaaagcacaa tcaattccag ctcctccgtc gtgcataaga aagatcatgt 
attcataagg gatgaggcgg gccacggcga tcccaggcct gagccgcggc ctacccagtc 
agttcagagc caggccctcc actaccggaa cagagagcgc tttgccacga tcaaatcagc 
atctttggtt acacgacaga tccatgagca tgagcaggag aacgagttgc gggaacagat 1560 
gtcaggttat aagcggatgc ggcgccagca ccagaagcag ctgatcgccc tggagaacaa 1620 
gctgaaggct gagatggacg agcaccgcct caagctacag aaggaggtgg agacgcatgc 1680 
caacaactcg tccatcgagc tggagaagct ggccaagaag caagtggcta tcatagaaaa 
ggaggcaaag gtagctgcag cagatgagaa gaagttccag caacagatct tggcccagca 
gaagaaagat ttgacaactt tcttagaaag tcagaagaag cagtataaga tttgtaagga 
aaaaataaaa gaggaaatga atgaggacca tagcacaccc aagaaagaga agcaagagcg 
gatctccaaa cataaagaga acttgcagca cacacaggct gaagaggaag cccaccttct 
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cactcaacag 


agactgtact 


acgacaaaaa 


ttgtcgtttc 


ttcaagcgga 


aaataatgat 


2040 


caagcggcac 


gaggtggagc 


agcagaacat 


tcgggaggaa 


ctaaataaaa 


agaggaccca 


2100 


gaaggagatg 


gagcatgcca 


tgctaatccg 


gcacgacgag 


tccacccgag 


agctagagta 


2160 


caggcagctg 


cacacgttac 


agaagctacg 


catggatctg 


atccgtttac 


agcaccagac 


2220 


ggaactggaa 


a'accagctgg 


agtacaataa 


gaggcgagaa 


agagaactgc 


acagaaagca 


2280 


tgtcatggaa 


cttcggcaac 


agccaaaaaa 


cttaaaggcc 


atggaaatgc 


aaattaaaaa 


2340 


acagtttcag 


gacacttgca 


aagtacagac 


caaacagtat 


aaagcactca 


agaatcacca 


2400 


gttggaagtt 


actccaaaga 


atgagcacaa 


aacaatctta 


aagacactga 


aagatgagca 


2460 


gacaagaaaa 


cttgccattt 


tggcagagca 


gtatgaacag 


agtataaatg 


aaatgatggc 


2520 


ctctcaagcg 


tfeacggctag 


atgaggctca 


agaagcagaa 


tgccaggcct 


tgaggctaca 


2580 


gctccagcag 


gaaatggagc 


tgctcaacgc 


ctaccagagc 


aaaatcaaga 


tgcaaacaga 


2640 


ggcacaacat 


gaacgtgagc 


tccagaagct 


agagcagaga 


gtgtctctgc 


gcagagcaca 


2700 


ccttgagcag 


aagattgaag 


aggagctggc 


tgcccttcag 


aaggaacgca 


gcgagagaat 


2760 


aaagaaccta 


ttggaaaggc 


aagagcgaga 


gattgaaact 


tttgacatgg 


agagcctcag 


2820 


aatgggattt 


gggaatttgg 


ttacattaga 


tfcttcctaag 


gaggactaca 


gatgagatta 


2880 


aattttttgc 


catttac 










2897 


<210> 7 

<211> 3148 

<212> DNA 

<213> Homo sapiens 












<400> 7 
ggcacgaggg 


tggcgccggg cggcggggtc 


ctgcgtggag 


agtgggacgc 


aacgccgaga 


60 


ccgcgagcag aggctgcgca 


cagccggatc 


eggcactcag 


cgaccggacc 


caaggatccg 


120 


ccggggaaca agccacagga gagcgactca ggaacaagtg 


tgggagagga 


agcggcggcg 


180 


gcggcgccgg gcccgggggt 


ggtgacagca ggtctgaggt 


tgcatcataa 


atacaaagga 


240 


ctgaagttat 


aaaagagaaa 


agagaagttt 


gctgctaaaa 


tgaatctgag 


caatatggaa 


300 


tattttgtgc 


cacacaeaaa 


aaggtactga 


agatttaccc 


cccaaaaaaa 


attgtcaatg 


360 


agaaataaag 


ctaactgata 


tcaaaaagca 


gagcctgctc 


tactggccat 


catgcgtaaa 


420 


ggggtgctga 


aggacccaga 


gattgccgat 


ctattctaca 


aagatgatcc 


tgaggaactt 


480 


tttattggtt 


tgcatgaaat 


tggacatgga 


agttttggag 


cagtttattt 


tgctacaaat 


540 


gctcacacca 


gtgaggtggt 


ggcaattaag 


aagatgtcct 


atagtgggaa 


gcagacccat 


600 


gagaaatggc 


aagatattct 


taaggaagtt 


aaatttttac 


gacaattgaa 


gcatcctaat 


660 
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actattgagt acaaaggctg ttacttgaaa gaacacactg 
tgcttaggct cagcctctga tttattagaa gttcataaaa 
atcgctgcca ttactcatgg agccttgcat ggactagcct 
attcataggg atattaaagc aggaaatatt cttctaacag 
gctgattttg gatctgcttc aatggcttct cctgccaact 
tggatggctc cagaggtgat cttagctatg gatgaaggac 
atttggtcac ttggcatcac ttgtattgaa ttggcggaac 
atgaatgcaa tgagtgcctt atatcacatt gcccagaatg 
aatgaatgga cagactcctt taggagattt gttgattact 
gaaaggccaa catcagcaga actattaagg catgactttg 
cgtgtcctca ttgacctcat acagaggaca aaagatgcag 
cagtaccgaa aaatgaaaaa aatacttttc caagagacac 
tcacaggagg atgaggaaga cagtgaacat ggaaccagcc 
ctgggcagca accattccat tccaagcatg tccgtgagca 
gtgaacagca tgcaggaagt catggacgag agcagttccg 
gacgaaagca caatcaattc cagctcctcc gtcgtgcata 
agggatgagg cgggccacgg cgatcccagg cctgagccgc 
agccaggccc tccactaccg gaacagagag cgctttgcca 
gttacacgac agatccatga gcatgagcag gagaacgagt 
tataagcgga tgcggcgcca gcaccagaag cagctgatcg 
gctgagatgg acgagcaccg cctcaagcta cagaaggagg 
tcgtccatcg agctggagaa gctggccaag aagcaagtgg 
aaggtagctg cagcagatga gaagaagttc cagcaacaga 
gatttgacaa ctttcttaga aagtcagaag aagcagtata 
aaagaggaaa tgaatgagga ccatagcaca cccaagaaag 
aaacataaag agaacttgca gcacacacag gctgaagagg 
cagagactgt actacgacaa aaattgtcgt ttcttcaagc 
cacgaggtgg agcagcagaa cattcgggag gaactaaata 
atggagcatg ccatgctaat ccggcacgac gagtccaccc 
ctgcacacgt tacagaagct acgcatggat ctgatccgtt 
gaaaaccagc tggagtacaa taagaggcga gaaagagaac 



cttggttggt gatggaatat 
aaccacttca ggaagtggag 
acctacattc tcatgcattg 
agccaggtca ggtaaaacta 
ccttcgtggg cacaccttac 
agtatgatgg gaaagttgat 
ggaagccgcc ccttttcaac 
actccccaac gttacagtct 
gcttgcagaa aatacctcag 
ttcgacgaga ccggccacta 
ttcgtgagct agataaccta 
ggaatggacc cttgaatgag 
tgaacaggga aatggacagc 
caggcagcca gagcagcagt 
aacttgtcat gatgcacgat 
agaaagatca tgtattcata 
ggcctaccca gtcagttcag 
cgatcaaatc agcatctttg 
tgcgggaaca gatgtcaggt 
ccctggagaa caagctgaag 
tggagacgca tgccaacaac 
ctatcataga aaaggaggca 
tcttggccca gcagaagaaa 
agatttgtaa ggaaaaaata 
agaagcaaga gcggatctcc 
aagcccacct tctcactcaa 
ggaaaataat gatcaagcgg 
aaaagaggac ccagaaggag 
gagagctaga gtacaggcag 
tacagcacca gacggaactg 
tgcacagaaa gcatgtcatg 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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gaacttcggc 


aacagccaaa 


aaacttaaag 


gccatggaaa 


tgcaaattaa 


aaaacagttt 


2580 


c aggac ac 1 1 


gc aaagt ac a 


gaccaaacag 


tataaagcac 


tcaagaatca 


ccagttggaa 


2640 


gfctacfcccaa 


agaatgagca 


caaaacaatc 


ttaaagacac 


tgaaagatga 


gcagacaaga 


2700 


aaacttgcca 


ttttggcaga 


gcagtatgaa 


cagagtataa 


atgaaatgat 


ggcctctcaa 


2760 


crccrttacorcrc 


tagatgaggc 


tcaagaagca 


gaatgccagg 


ccttgaggct 


acagctccag 


2820 


caggaaafcgg 


agctgc tcaa 


cgcctaccag 


agcaaaatca 


agatgcaaac 


agaggcacaa 


2880 


catgaacgtg 


agctccagaa 


gctagagcag 


agagtgtctc 


tgcgcagagc 


acaccttgag 


2940 


cagaagattg 


aagaggagct 


ggctgccctt 


cagaaggaac 


gcagcgagag 


3 f* 3 a 3 a 

adLaaaydau 




ctattggaaa 


ggcaagagcg 


agagattgaa 


acttttgaca 


tggagagcct 


cagaatggga 


3060 


tttgggaatt 


tggttacatt 


agattttcct 


aaggaggact 


acagatgaga 


ttaaattttt 


3120 


tgccatttac 


aaaaaaaaaa 


aaaaaaaa 








3148 



<210> 8 

<211> 4188 

<212> DNA 

<213> Homo sapiens 

<400> 8 

gccggggaac aagccacagg agagcgactc aggaacaagt gtgggagagg aagcggcggc 60 

ggcggcgccg ggcccggggg tggtgacagc aggtctgagg ttgcatcata aatacaaagg 12 0 

actgaagtta taaaagagaa aagagaagtt tgctgctaaa atgaatctga gcaatatgga 180 

atattttgtg coacacacaa aaaggtactg aagatttacc ccccaaaaaa aattgtcaat 240 

gagaaataaa gctaactgat atcaaaaagc agagcctgct ctactggcca tcatgcgtaa 300 

aggggtgctg aaggacccag agattgccga tctattctac aaagatgatc ctgaggaact 3 60 

ttttattggt ttgcatgaaa ttggacatgg aagttttgga gcagtttatt ttgctacaaa 420 

tgctcacacc aatgaggtgg tggcaattaa gaagatgtcc tatagtggga agcagaccca 480 

tgagaaatgg caagatattc ttaaggaagt taaattttta cgacaattga agcatcctaa 540 

tactattgag tacaaaggct gttacttgaa agaacacact gcttggttgg tgatggaata 600 

ttgcttaggc tcagcctctg atttattaga agttcataaa aaaccacttc aggaagtgga 660 

gatcgctgcc attactcatg gagccttgca tggactagcc tacctacatt ctcatgcatt 720 

gattcatagg gatattaaag caggaaatat tcttctaaca gagccaggtc aggtaaaact 780 

agctgatttt ggatctgctt caatggcttc tcctgccaac. tccttcgtgg gcacacctta 840 

ctggatggct ccagaggtga tcttagctat ggatgaagga cagtatgatg ggaaagttga 900 

tatttggtca cttggcatca cttgtattga attggcggaa cggaagccgc cccttttcaa 960 

catgaatgca atgagtgcct tatatcacat tgcccagaat gactecccaa cgttacagtc 1020 
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taatgaatgg 
ggaaaggcca 
acgtgtcctc 
acagtaccga 
gtcacaggag 
cctgggcagc 
tgtgaacagc 
tgacgaaagc 
aagggatgag 
gagccaggcc 
ggttacacga 
ttataagcgg 
ggctgagatg 
ctcgtccatc 
aaaggtagct 
agatttgaca 
aaaagaggaa 
caaacataaa 
acagagactg 
gcacgaggtg 
gatggagcat 
gctgcacacg 
ggaaaaccag 
ggaacttcgg 
tcaggacact 
agttactcca 
aaaacttgcc 
agcgttacgg 
gcaggaaatg 
acatgaacgt 
gcagaagatt 



acagactcct 
acatcagcag 
attgacctca 
aaaatgaaaa 
gatgaggaag 
aaccattcca 
atgcaggaag 
acaatcaatt 
gcgggccacg 
ctccactacc 
cagatccatg 
atgcggcgcc 
gacgagcacc 
gagctggaga 
gcagcagatg 
actttcttag 
atgaatgagg 
gagaacttgc 
tactacgaca 
gagcagcaga 
gccatgctaa 
ttacagaagc 
ctggagtaca 
caacagccaa 
tgcaaagtac 
aagaatgagc 
attttggcag 
ctagatgagg 
gage tgc tea 
gagctccaga 
gaagaggagc 



ttaggagatt tgttgattac tgettgeaga 
aactattaag gcatgacttt gttcgacgag 
tacagaggac aaaagatgea gttcgtgagc 
aaatactttt ccaagagaca eggaatggae 
acagfcgaaca tggaaccagc ctgaacaggg 
ttccaagcat gtccgtgagc acaggcagcc 
tcatggacga gagcagttcc gaacttgtca 
ccagctcctc cgtcgtgcat aagaaagatc 
gcgatcccag gcctgagccg cggcctaccc 
ggaacagaga gcgctttgcc acgatcaaat 
agcatgagca ggagaacgag ttgegggaac 
agcaccagaa gcagctgatc gecctggaga 
gcctcaagct acagaaggag gtggagacgc 
agctggccaa gaagcaagtg gctatcatag 
agaagaagtt ccagcaacag atcttggccc 
aaagtcagaa gaagcagtat aagatttgta 
accatagcac acccaagaaa gagaagcaag 
agcacacaca ggctgaagag gaagcccacc 
aaaattgtcg tttcttcaag eggaaaataa 
acatteggga ggaactaaat aaaaagagga 
tccggcacga cgagtccacc cgagagctag 
tacgcatgga tetgatcegt ttacagcacc 
ataagaggcg agaaagagaa ctgcacagaa 
aaaacttaaa ggccatggaa atgeaaatta 
agaccaaaca gtataaagca ctcaagaatc 
acaaaacaat cttaaagaca ctgaaagatg 
agcagtatga acagagtata aatgaaatga 
ctcaagaagc agaatgecag gecttgagge 
acgcctacca gagcaaaatc aagatgcaaa 
agctagagca gagagtgtct ctgegcagag 
tggctgccct tcagaaggaa egcagegaga 
16 



aaatacctca 
accggccact 
tagataacct 
ccttgaatga 
aaatggacag 
agagcagcag 
tgatgeaega 
atgtattcat 
agtcagttca 
cagcatcttt 
agatgtcagg 
acaagctgaa 
atgccaacaa 
aaaaggaggc 
agcagaagaa 
aggaaaaaat 
ageggatetc 
ttctcactca 
tgatcaagcg 
cccagaagga 
agtacaggca 
agaeggaact 
agcatgtcat 
aaaaacagtt 
accagttgga 
agcagacaag 
tggcctctca 
tacagctcca 
cagaggcaca 
cacaccttga 
gaataaagaa 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040- 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2 640 

2700 

2760 

2820 

2880 



owcrwin- vwo 
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cctattggaa 
atttgggaat 
ttgccattta 
cacattcccc 
tcgtgtcgga 
grtcaactttc 
tattgatgtg; 
aaaccaacaa 
ctgtgaaagt 
ccgcagcctt 
tttattcatc 
tacagaaact 
ggggtgggtg 

gcatatctga 
acagatccca 
aaggagaggt 
aacttcttct 
acactgtctc 
tcccatttct 
aataaaaaat 
acaaaacatt 
ttgtaaataa 



aggcaagagc 

ttggttacat 

caaaaaaaaa 

attttaacgg 

ctagtgcctg 

agtgctggcc 

attcaaaatg 

caaaaaaagt 

aaagcaaaaa 

acaccttaac 

aggaataaaa 

aattgtgatg 

gggtgaagag 

tgtatcataa 

gtagcacaat 

attacttatt 

tcgtgttctt 

tcaatcagat 

tggtcataaa 

gatgaaaaaa 

tttcaggfctc 

aagtgttcat 



gagagattga 

tagattttcc 

aaaaaaaaga 

gcgttgctct 

tttattctta 

aaaacctggc 

tttcagtgaa 

ggatgtatat 

ataataataa 

tagctgctgc 

aaaacaaagt 

tacatatcag 

atcttgtgat 

tcatttctgc 

tgggctttgg 

ttcatattgt 

ccccacctcc 

catctggatc 

tattgcatta 

aaaaaggctt 

caaggaagag 

cacaaaaaaa 



aacttttgac 

taaggaggac 

aaacagaaaa 

cactctctct 

ctccatcagg 

cgtctcttct 

aactttggag 

tgctttaagc 

taaatgccaa 

attattttat 

tttattaaag 

tggtgacata 

tttagactgc 

tgtgcagagg 

caaatgggta 

attttaactg 

ttccttttcc 

tttttcttta 

ttcacacttt 

tacttctttt 

catgataact 

aaaaaaaaaa 



atggagagcc 
tacagatgag 
aaattcagac 
ctctcttact 
ggcccccttc 
attcacagta 
acagttttaa 
aatcactcat 

gggggagaga 

tttattttat 
attgaaaatt 
ttattacttt 
tgcagagtta 
agggatacac 
ttttgtgtat 
tttctcggat 
agttcagtat 
tctcccttcc 
caaactgtgt 
gcatgcactt 
gtcagagctt 
aaaaaaaa 



tcagaatggg 

attaaatttt 

cctgcaaaac 

cttactgaca 

ctccccccgt 

cacgtcacag 

caaaaccaat 

taccaccaat 

gacacaatat 

ttttttggta 

tgatacattt 

tttggggacg 

acttgtctea 

ttaggggctc 

agaggaattt 

caaatttttt 

ttggagttca 

ccttcctaag 

attttcttac 

taaaaacaaa 

ttaattatat 



<210> 9 

<211> 2807 

<212> DNA 

<213> Homo sapiens 

<400> 9 

gtactgaaga tttacccccc aaaaaaaatt 
aaaagcagag cctgctctac tggccatcat 
tgccgatcta ttctacaaag atgatcctga 
acatggaagt tttggagcag tttattttgc 
aattaagaag atgtcctata gtgggaagca 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4188 



gtcaatgaga aataaagcta actgatatca 60 

gcgtaaaggg gtgctgaagg acccagagat 120^ 

ggaacttttt attggtttgc atgaaattgg 180 

tacaaatgct cacaccagtg aggtggtggc 240 

gacccatgag aaatggcaag atattcttaa 300 
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ggaagttaaa 
cttgaaagaa 
attagaagtt 
cttgcatgga 
aaatattctt 
ggcttctcct 
agctatggat 
tattgaattg 
tcacattgcc 
gagatttgtt 
attaaggcat 
gaggacaaaa 
acttttccaa 
tgaacatgga 
aagcatgtcc 
ggacgagagc 
ctcctccgtc 
tcccaggcct 
cagagagcgc 
tgagcaggag 
ccagaagcag 
caagctacag 
ggccaagaag 
gaagttccag 
tcagaagaag 
tagcacaccc 
cacacaggct 
ttgtcgtttc 
tcgggaggaa 
gcacgacgag 
catggatctg 



tttttacgac aattgaagca tcctaatact 
cacactgctt ggttggtgat ggaatattgc 
cataaaaaac cacttcagga agtggagatc 
ctagcctacc tacattctca tgcattgatt 
ctaacagagc caggtcaggt aaaactagct 
gccaactcct tcgtgggcac accttactgg 
gaaggacagt atgatgggaa agttgatatt 
gcggaacgga agccgcccct tttcaacatg 
cagaatgact ccccaacgtt acagtctaat 
gattactgct tgcagaaaat acctcaggaa 
gactttgttc gacgagaccg gccactacgt 
gatgcagttc gtgagctaga taacctacag 
gagacacgga atggaccctt gaatgagtca 
accagcctga acagggaaat ggacagcctg 
gtgagcacag gcagccagag cagcagtgtg 
agttccgaac ttgtcatgat gcacgatgac 
gtgcataaga aagatcatgt attcataagg 
gagccgcggc ctacccagtc agttcagagc 
tttgccacga tcaaatcagc atctttggtt 
aacgagttgc gggaacagat gtcaggttat 
ctgatcgccc tggagaacaa gctgaaggct 
aaggaggtgg agacgcatgc caacaactcg 
caagtggcta tcatagaaaa ggaggcaaag 
caacagatct tggcccagca gaagaaagat 
cagtataaga tttgtaagga aaaaataaaa 
aagaaagaga agcaagagcg gatctccaaa 
gaagaggaag cccaccttct cactcaacag 
ttcaagcgga aaataatgat caagcggcac 
ctaaataaaa agaggaccca gaaggagatg 
tccacccgag agctagagta caggcagctg 
atccgtttac agcaccagac ggaactggaa 



attgagtaca 
ttaggctcag 
gctgccatta 
catagggata 
gattttggat 
atggctccag 
tggtcacttg 
aatgcaatga 
gaatggacag 
aggccaacat 
gtcctcattg 
taccgaaaaa 
caggaggatg 
ggcagcaacc 
aacagcatgc 
gaaagcacaa 
gatgaggcgg 
caggccctcc 
acacgacaga 
aagcggatgc 
gagatggacg 
tccatcgagc 
gtagctgcag 
ttgacaactt 
gaggaaatga 
cataaagaga 
agactgtact 
gaggtggagc 
gagcatgcca 
cacacgttac 
aaccagctgg 



aaggctgtta 3 60 

cctctgattt 420 

ctcatggagc 480 

ttaaagcagg 540 

ctgcttcaat 600 

aggtgatctt 660 

gcatcacttg 720 

gtgccttata 780 

actcctttag 840 

cagcagaact 900 

acctcataca 960 

tgaaaaaaat 1020 

aggaagacag 1080 

attccattcc 1140 

aggaagtcat 1200 

tcaattccag 12 60 

gccacggcga 1320 

actaccggaa 1380 

tccatgagca 144C 

ggcgccagca 1500 

agcaccgcct 1560 

tggagaagct 1620 

cagatgagaa 1680 

tcttagaaag 1740 

atgaggacca 1800 

acttgcagca 1860 

acgacaaaaa 1920 

agcagaacat 1980 

tgctaatccg 2040 

agaagctacg 2100 

agtacaataa 2160 
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gaggcgagaa 


agagaactgc 


acagaaagca 


tgtcatggaa 


cttcggcaac 


agccaaaaaa 


2220 


cttaaaggcc 


atggaaatgc 


aaattaaaaa 


acagtttcag 


gacacttgca 


aagtacagac 


2280 


caaacagtat 


aaagcactca 


agaatcacca 


gttggaagtt 


actccaaaga 


atgagcacaa 


2340 


aacaatctta 


aagacactga 


aagatgagca 


gacaagaaaa 


cttgccattt 


tggcagagca 


'2400 


gtatgaacag 


agtataaatg 


aaatgatggc 


ctctcaagcg 


ttacggctag atgaggctca 


2460 


agaagcagaa 


tgccaggcct 


tgaggctaca 


gctccagcag 


gaaatggagc 


tgctcaacgc 


2520 


ctaccagagc 


aaaatcaaga 


tgcaaacaga 


ggcacaacat 


gaacgtgagc 


tccagaagct 


2580 


agagcagaga 


gtgtc fcc tgc 


*~T ^ **T2a rtr* s& A 
yCdy ayCaLa 


i_.i_.t- i— y cxy >— y 


aagattgaag 


aggagctggc 


2640 


tgcccttcag 


aaggaacgca 


gcgagagaat 


aaagaaccta 


ttggaaaggc 


aagagcgaga 


2700 


gattgaaact 


tttgacatgg 


agagcctcag 


aatgggattt 


gggaatttgg 


ttacattaga 


2760 


ttttcctaag 


gaggactaca 


gatgagatta 


aattttttgc 


catttac 




2807 



<210> 10 

<211> 4620 

<212> DNA 

<213> Homo sapiens 

<400> 10 

gagccgtgat tgtgccacta cactccagcc ctgacctctt acaecgaagc agtcctcata 60 
cgtcagcctc ccaaagtgct gggattacag atgaaccaag gafecgggata gcagtataaa 120 



attagaatca. agacagctga ctgctcagca ggatgccatc aactaacaga gcaggcagcc 



atctcagaga aattggccat ggaagctttg gagcagtgta ttttgcacga gatgtgcgta 



180 



tgaaggaccc tgaaattgca gagctcttct tcaaagaaga tccagagaag ctcttcacag 240 



300 



ccaatgaagt ggtggccatc aagaaaatgt cttatagtgg aaagcagtct actgagaaat 3 60 

ggcaggatat tattaaggaa gtcaagtttc feacaaagaat aaaacatccc aacagtatag 420 

aatacaaagg ctgttattta cgtgaacaca cagcatggct tgtaatggaa tattgtttag 480 

gatctgcttc ggatttacta gaagttcaca aaaagqcatt acaagaagtg gaaatagcag 540 

caattacaca tggtgctctt cagggattag cctacttaca. ttctcatact atgattcata 600 

gagatatcaa agcaggaaat atccttctga cagaaccagg ccaggtgaaa cttgctgact 660 

ttggctctgc ttccatggca tcacctgcca attcctttgt gggaacgccg tattggatgg 720 

ccccagaagt aattttagcc atggatgaag gacaatatga tggcaaagta gatgtgtggt 780 

ctcttggaat aacatgtatt gaactagcgg aaaggaagcc , . t cc 1 1 tat 1 1 aatatgaatg 840 

caatgagtgc cttatatcac atagcccaaa atgaatcccc tacactacag tctaatgaat 900 

ggtctgatta ttttcgcaac tttgtagatt cttgcctcca gaaaatccct caagatcgac 960 

ctacatcaga ggaactttta aagcacatat ttgttcttcg ggagcgccct gaaaccgtgt 1020 
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taatagatct cattcagagg acaaaggatg cagtaagaga gctggacaat ctgcagtatc 
gaaagatgaa gaaactcctt ttccaggagg cacataatgg accagcagta gaagcacagg 
aagaagaaga ggaacaagat catggtgttg gccggacagg aacagttaat agtgttggaa . 
gtaatcaatc cattcccagc atgtccatca gtgccagcag ccaaagcagt agtgttaaca 
gtcttccaga tgtctcagat gacaagagtg agctagacat gatggaggga gaccacacag 
tgatgtctaa cagttctgtt atccatttaa aaccagagga agaaaattac agagaagagg 
gagatcctag aacaagagca tcagatccac aatctccacc ccaagtatct cgtcacaaat 
cacactatcg taatcgagaa cactttgcta ctatacggac agcatcactg gttacgaggc 
aaatgcaaga acatgagcag gactctgagc ttagagaaca aatgtctggc tataagcgaa 
tgaggcgaca acatcaaaag caactgatga ctctggaaaa caagctaaag gctgagatgg 
atgaacatcg cctcagatta gacaaagatc ttgaaactca gcgtaacaat tttgctgcag 
aaatggagaa acttatcaag aaacaccagg ctgctatgga gaaagaggct aaagtgatgt 
ccaatgaaga gaaaaaattt cagcaacata ttcaggccca acagaagaaa gaactgaata 
gttttctcga gtcccagaaa agagagtata aacttcgaaa agagcagctt aaagaggagc 
taaatgaaaa ccagagtacc cccaaaaaag aaaaacagga gtggctttca aagcagaagg 
agaatataca gcatttccaa gcagaagaag aagctaacct tcttcgacgt caaagacaat 
acctagagct ggaatgccgt cgcttcaaga gaagaatgtt acttgggcgt cataacttag 
agcaggacct tgtcagggag gagttaaaca aaagacagac tcagaaggac ttagagcatg 
ccatgctact ccgacagcat gaatctatgc aagaactgga gttccgccac ctcaacacaa 
ttcagaagat gcgctgtgag ttgatcagat tacagcatca aactgagctc actaaccagc 
tggaatataa taagcgaaga gaacgagaac taagacgaaa gcatgtcatg gaagttcgac 
aacagccfcaa gagtttgaag tctaaagaac tccaaataaa aaagcagttt caggatacct 
gcaaaatcca aaccagacag tacaaagcat taagaaatca cctgctggag actacaccaa 
agagtgagca caaagctgtt ctgaaacggc tcaaggagga acagacccgg aaattagcta 
tcttggctga gcagtatgat cacagcatta atgaaatgct ctccacacaa gccctgcgtt 
tggatgaagc acaggaagca gagtgccagg ttttgaagat gcagctgcag caggaactgg 
agctgttgaa tgcgtatcag agcaaaatca agatgcaagc tgaggcacaa catgatcgag 
agcttcgega gcttgaacag agggtctccc tccggagggc actcttagaa caaaagattg 
aagaagagat gttggctttg cagaatgagc gcacagaacg aatacgaagc ctgttggaac 
gtcaagccag agagattgaa gcttttgact ctgaaagcat gagactaggt tttagtaata 
tggtcctttc taatctctcc cctgaggcat tcagccacag ctacccggga gcttctggtt 
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1080 
1140 
1200 
1260 
1320 
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1560 
1620 
1680 
1740 
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1920 
.1980 
,.2040. 
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ggtcacacaa 


ccctactggg 


ggtccaggac 


ctcactgggg 


tcatcccatg 


ggtggcccac 


2940 


cacaagcttg 


gggccatcca 


atgcaaggtg 


gaccccagcc 


atggggtcac 


ccttcagggc 


3000 


caatgcaagg 


ggtacctcga 


ggtagcagta 


tgggagtccg 


caatagcccc 


caggctctga 


3060 


ggcggacagc 


ttctggggga 


cggacagagc 


agggcatgag 


cagaagcacg 


agtgtcactt 


3120 


t cacaaatatc 


caatgggtca 


cacatgtctt 


atacataact 


taataattga 


gagtggcaat 


3180 


tccgctggag 


ctgtctgcca 


aaagaaactg 


cctacagaca 


tcatcacagc 


agcctcctca 


3240 


... cttgggtact 


acagtgtgga 


agctgagtgc 


atatggtata 


ttttattcat 


ttttgtaaag 


3300 


cgttctgttt 


tgtgtttact 


aattgggatg 


tcatagtact 


tggctgccgg 


gtttgtttgt 


3360 


ttttggggaa 


attttgaaaa 


gtggagttga 


tattaaaaat 


aaatgtgtat 


gtgtgtacat 


3420 


atatatacac 


acacatacac 


atatattatg 


catgtggtga 


aaagaattgg 


ctagataggg 


3480 


gatttttctg 


aacactgcaa 


aaatagaacg 


tagcaaaatg 


gcttcagtta 


tcacttttgg 


3540 


gtgtctgtat 


cctaagaagt 


ttctgaaaag 


atctaaagcc 


tttttatccc 


atatcccaaa 


3600 


ttcttatgag 


ccactcacag 


caggcagcat 


atgttgaaat 


aagttattac 


tggtacacac 


3660 


ctgcattgcc 


tcaccagtgt 


atttatttgt 


tattaaattg 


atctgacttc 


tcagcctcat 


3720 


ttggactaaa 


aaaagaaagc 


agaaatccat 


gaacacattg 


cttctcggcc 


ttttggctaa 


3780 


gatcaagtgt 


agaaatccat 


gaacactaaa 


ggacttcatt 


gattttttca 


gagagtagaa 


3840 


aacaacttag 


tttttctttt 


ttcctgaatg 


cgtcataggc 


ttgtgagtga 


tttttgtcca 


3900 


ttcaattgtg 


ccttctttgt 


attatgataa 


gatgggggta 


cttaaggaga 


tcaeaagttg 


3960 


tgtgaggatt 


gcattaacaa 


acctatgagc 


cttcaatggg 


gaagaccaga 


agggtgagag 


4020 


gggccctgaa 


agttcatatg 


gtgggtatgt 


cccgcagcag 


agtgaggaga 


tgaagcttac 


4080 


gtgtcctgac 


gttttgttgc 


ttatactgtg 


atatctcatc 


ctagctaagc 


tctataatgc 


4140 


... ccgagacccc 


aaacagtact 


tttactttgt. 


ttgtacaaaa 


acaaagacat 


atagccaata 


4200 


caaatcaaat 


gccggaggtg 


tttgatgcca 


tatttgcaaa 


ttgccatcta 


ttgaaattct 


4260 


,cgtcacacta 


catagacata 


attgttatct ccttttggct 


tatgtgattt 


tctgtttaca 


4320 


agtagaatag 


ccaattattt 


aaatgtttag 


ttgccacagt 


gaaccaggag 


tcactgagcc 


4380 


aatgacttta 


ccagctgctg 


actaatcttc 


atcaccactg 


tagattttgc 


tgcatgtgca 


4440 


ggtcctctat 


ttttaattgc 


tgttttcgtt 


gctgcagtac 


tttacaaact 


tctagttcgt 


4500 


tgagacttag 


tgaccatttg 


gcatcaagtt 


aacatcacac 


aataggaaac 


accacttcca 


4560 


caagtctcaa 


gcctcagtgc 


taaagtacta 


ctgaaaagga 


actaggaagt 


ttggccaatt 


4620 



<210> 11 
<211> 4536 

21 



BNSDOCID: <WO 03051905A2J_> 



WO 03/051905 



PCT/US02/39742 



<212> DNA 

<213> Homo sapiens 

<220> 

<2 2 1 > mi s cofeature 

<222> (1)..(4536) 

<223> "n" is A, C, G, or T 



tacagatgaa ccaaggatcg ggatagcagt ataaaattag aatcaagaca gctgactgct 
cagcaggatg ccatcaacta acagagcagg cagcctgaag gaccctgaaa ttgcagagct 
cttattcaaa gaagatccag agaagctctt cacagatctc agagaaattg gccatggaag 
ctttggagca gtgtattttg cacgagatgt gcgtaccaat gaagtggtgg ccatcaagaa 
aatgtcttat agtggaaagc agtctactga gaaatggcag gatattatta aggaagtcaa 
gtttctacaa agaataaaac atcccnaaca gtatagaata caaaggctgt tatttacgtg 
aacacacagc atggcttgta atggaatatt gtttaggatc tgcttcggat ttactagaag 
ttcacaaaaa gccattacaa gaagtggaaa tagcagcaat tacacatggt gctcttcagg 
gattagccta cttacattct catactatga ttcatagaga tatcaaagca ggaaatatcc 
ttctgacaga accaggccag gtgaaacttg ctgactttgg ctctgcttcc atggcatcac 
ctgccaattc ctttgtggga acgccgtatt ggatggcccc agaagtaatt ttagccatgg 
atgaaggaca atafcgatggc aaagtagatg tgtggtctct tggaataaca tgtattgaac 
tagcggaaag gaagcctcct ttatttaata tgaatgcaat gagtgcctta tatcacatag 
cccaaaatga atcccctaca ctacagtcta atgaatggtc tgattatttt cgcaactttg 
tagattcttg cctccagaaa atccctcaag atcgacctac atcagaggaa cttttaaagc 
acatatttgt tcttcgggag cgccctgaaa ccgtgttaat agatctcatt cagaggacaa 
aggatgcagt aagagagctg gacaatctgc agtatcgaaa gatgaagaaa ctccttttcc 
aggaggcaca taatggacca gcagtagaag cacaggaaga agaagaggaa qaagatcatg 
gtgttggccg gacaggaaca gttaatagtg ttggaagtaa tcaatccatt cccagcatgt 
ccatcagtgc cagcagccaa agcagtagtg ttaacagtct tccagatgtc tcagatgaca 
agagtgagct agacatgatg gagggagacc acacagtgat gtctaacagt tctgttatcc 
atttaaaacc agaggaagaa aattacagag aagagggaga tcctagaaca agagcatcag 
atccacaatc tccaccccaa gtatctcgtc acaaatcaca ctatcgtaat cgagaacact 
ttgctactat acggacagca tcactggtta cgaggcaaat gcaagaacat gagcaggact 
ctgagcttag agaacaaatg tctggctata agcgaatgag gcgacaacat caaaagcaac 
tgatgactct ggaaaacaag ctaaaggctg agatggatga acatcgcctc agattagaca 
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=> a cri=i t~ c t~ t~ era 


aactcagegt 


aacaattttg 


ctgcagaaat 


ggagaaactt 


atcaagaaac 


1620 


pa ctctc t~ ere 
c*.\_ v^. ciy y \_- i_» y \~ 


tatggagaaa gaggctaaag tgatgtccaa 


tgaagagaaa 


aaatttcagc 


1680 




ggcccaacag 


aagaaagaac 


tgaatagttt 


tctcgagtcc 


cagaaaagag 


1740 


pi rr\- za f- pi = /— ■ f- 
dy LCI L.CLGLGLW I— 


tcgaaaagag 


cagcttaaag 


aggagctaaa 


tgaaaaccag 


agtaccccca 


1800 




acaggagtgg 


ctttcaaagc 


agaaggagaa 


tat ac age a t 


ttccaagcag 


1860 


Pi pt a pi pi rrpi Pi err* 
ex. ay a.a.y cx ci y w 


taaccttctt 


egaegtcaaa 


gacaatacct 


agagc t ggaa 


tgccgtcgct 


1920 


' f" c a a era era a er 


aatgttactt 


gggegtcata 


acttagagca 


ggaccttgtc 


agggaggagt 


1980 


taaacaaaag 


acagactcag 


aaggacttag 


ageatgecat 


gctactccga 


cagcatgaat 


2040 


etatgeaaga 


actggagttc 


cgccacctca 


acacaattca 


gaagatgege 


tgtgagttga 


2100 


tcagattaca 


gcatcaaact 


gagctcacta 


aecagctgga 


atataataag 


cgaagagaac 


2160 


gagaactaag acgaaagcat 


gtcatggaag ttcgacaaca . 


gectaagagt 


ttgaagtcta 


2220 


aagaactcca 


aataaaaaag 


cagtttcagg 


atacctgeaa 


aatccaaacc 


agacagtaca 


2280 


aagcattaag 


aaatcacctg 


ctggagacta 


caccaaagag 


t gage ac aaa 


gctgttctga 


2340 


aacggctcaa 


ggaggaacag 


acceggaaat 


tagctatctt 


gerc t gage agr 


tatgatcaca 


2400 


gcattaatga 


aatgctctcc 


acacaagccc tgcgtttgga 


tgaagcacag 


gaagcagagt 


2460 


gccaggtttt 


gaagatgcag 


ctgeagcagg 


aactggagct 


gttgaatgcg 


t at c agagc a 


252 0 


aaatcaagat 


gcaagctgag 


gcacaacatg 


atcgagagct 


fcegegagett 


gaacagaggg 


2580 


tctccctccg 


gagggcactc 


ttagaacaaa 


agattgaaga 


a gaga t qrt t or 


gc 1 1 1 gcaga 


2640 


atgagegcac 


agaacgaata 


egaagectgt 


tggaaegtea 


age c agagag 


attgaagctt 


2700 


ttgactctga 


aagcatgaga 


ctaggtttta 


gtaatatggt 


cctttctaat 


ctctcccctg 


2760 


aggcattcag 


ccacagctac 


cegggagett 


ctggttggtc 


acacaaccct 


ac t ggggg t c 


2820 


caggacctca 


ctggggtcat 


cccatgggtg 


gcccaccaca 


agcttggggc 


catccaatgc 


2880 


aaggtggacc 


ccagccatgg 


ggtcaccctt 


cagggecaat 


gcaaggggta 


cctcgaggta 


2940 


gcagtatggg 


agtccgcaat 


agcccccagg 


etctgaggeg 


gacagc t.tct 


gggggacgga 


3000 


cagagcaggg 


catgagcaga agcacgagtg tcacttcaca 


aatatccaat 


gggtcacaca 


3060 


tgtcttatac 


ataacttaat 


aattgagagt 


ggcaattccg 


ctggagc tgt 


ctgccaaaag 


3120 


aaactgecta 


cagacatcat 


cacagcagcc 


tcctcacttg 


ggtactacag 


t art ggaagc t 


3180 


gagtgeatat 


ggtatatttt 


attcattttt 


gtaaagcgtt 


ctgttttgtg 


tttactaatt 


3240 


gggatgtcat 


agtacttggc 


tgccgggttt 


gtttgttttt 


ggggaaattt 


tgaaaagtgg 


3300 


agttgatatt 


aaaaataaat 


gtgtatgtgt 


gtacatatat 


atacacacac 


atacacatat 


3360 


attatgeatg 


tggtgaaaag 


aattggctag 


ataggggatt 


tttctgaaca 


ctgcaaaaat 


3420 
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agaacgtagc 
gaaaagatct 
cagcatatgt 
atttgttatt 
atccatgaac 
actaaaggac 
tgaatgcgtc 
tgataagatg 
atgagccttc 
gtatgtcccg 
actgtgatat 
ctttgtttgt 
atgccatatt 
ttatctcctt 
gtttagttgc 
atcttcatca 
ttcgttgctg 
caagttaaca 
gtactactga 



aaaatggctt 
aaagcctttt 
tgaaataagt 
aaattgatct 
acattgcttc 
ttcattgatt 
ataggcttgt 

ggggtactta 
aatggggaag 
cagcagagtg 
ctcatcctag 
acaaaaacaa 
tgcaaattgc 
ttggcttatg 
cacagtgaac 
ccactgtaga 
cagtacttta 
tcacacaata 
aaaggaacta 



cagttatcac 
tatcccatat 
tattactggt 
gacttctcag 
tcggcctttt 
ttttcagaga 
gagtgatttt 
aggagatcac 
accagaaggg 
aggagatgaa 
ctaagctcta 
agacatatag 
catctattga 
tgattttctg 
caggagtcac 
ttttgctgca 
caaacttcta 
ggaaacacca 
ggaagtttgg 



ttttgggtgt 
cccaaattct 
acacacctgc 
cctcatttgg 
ggctaagatc 
gtagaaaaca 
tgtccattca 
aagttgtgtg 
tgagaggggc 
gcttacgtgt 
taatgcccaa 
ccaatacaaa 
aattctcgtc 
tttacaagta 
tgagccaatg 
tgtgcaggtc 
gttcgttgag 
cttccacaag 
ccaatt 



ctgtatccta 
tatgagccac 
attgcctcac 
actaaaaaaa 
aagtgtagaa 
acttagtttt 
attgtgcctt 
aggattgcat 
cctgaaagtt 
cctgacgttt 
gaccccaaac 
tcaaatgccg 
acactacata 
gaatagccaa 
actttaccag 
ctctattttt 
acttagtgac 
tctcaagcct 



agaagtttct 
tcacagcagg 
cagtgtattt 
gaaagcagaa 
atccatgaac 
tcttttttcc 
ctttgtatta 
taacaaacct 
catatggtgg 
tgttgcttat 
agtactttta 
gaggtgtttg 
gacataattg 
ttatttaaat 
ctgctgacta 
aattgctgtt 
catttggcat 
cagtgctaaa 



<210> 12 

<211> 4535 

<212> DNA 

<213> Homo sapiens 

<400> 12 

tacagatgaa ccaaggatcg ggatagcagt 
cagcaggatg ccatcaacta acagagcagg 
cttcttcaaa gaagatccag agaagctctt 
ctttggagca gtgtattttg cacgagatgt 
aatgtcttat agtggaaagc agtctactga 
gtttctacaa agaataaaac atcccaacag 
acacacagca tggcttgtaa tggaatattg 
tcacaaaaag ccattacaag aagtggaaat 
attagcctac ttacattctc atactatgat 



3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4536 



ataaaattag 
cagcctgaag 
cacagatctc 
gcgtaccaat 
gaaatggcag 
tatagaatac 
tttaggatct 
agcagcaatt 
tcatagagat 
24 



aatcaagaca 
gaccctgaaa 
agagaaattg 
gaagtggtgg 
gatattatta 
aaaggctgtt 
gcttcggatt 
acacatggtg 
atcaaagcag 



gctgactgct 

ttgcagagct 

gccatggaag 

ccatcaagaa 

aggaagtcaa 

atttacgtga 

tactagaagt 

ctcttcaggg 

gaaatatcct 
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tctgacagaa ccaggccagg 


tgaaacttgc 


tgactttggc 


tctgcttcca 


tggcatcacc 


600 


tgccaattcc 


tttgfcgggaa 


cgccgtattg 


gatggcccca 


gaagtaattt 


tagccatgga 


660 


tgaaggacaa 


tatgatggca 


aagtagatgt 


gtggtctctt 


ggaataacat 


gtattgaact 


720 


agcggaaagg 


aagcctcctt 


tatttaatat 


gaatgcaatg 


agtgccttat 


atcacatagc 


7 80 


ccaaaatgaa 


tcccctacac 


tacagtctaa 


tgaatggtct 


gattattttc 


gcaactttgt 


840 


agattcttgc 


ctccagaaaa 


tccctcaaga 


tcgacctaca 


tcagaggaac 


ttttaaagca 


900 


catatttgtt 


ettegggage 


gccctgaaac 


cgtgttaata 


gatctcattc 


agaggacaaa 


960 


ggatgcagta 


agagagctgg 


acaatctgea 


gtatcgaaag 


atgaagaaac 


tccttttcca 


1020 


ggaggcacat 


aatggaccag 


cagtagaagc 


acaggaagaa 


gaagaggaac 


aagatcatgg 


1080 


tgttggccgg acaggaacag ttaatagtgt 


tggaagtaat 


caatccattc 


ccagcatgtc 


1140 


catcagtgcc 


ageagecaaa 


gcagtagtgt 


taacagtctt 


ccagatgtct 


cagatgacaa 


1200 


gag t gage t a 


gacatgatgg 


agggagacca 


cacagtgatg 


tctaacagtt 


ctgttatcca 


1260 


tttaaaacca 


gaggaagaaa 


attacagaga 


agagggagat 


cctagaacaa 


gagcatcaga 


1320 


tccacaatct 


ccaccccaag 


tatctegtea 


caaatcacac 


tategtaate 


gagaacactt 


1380 


tgctactata -eggacagcat 


cactggttac 


gaggcaaatg 


caagaacatg 


agcaggactc 


1440 


tgagcttaga gaacaaatgt ctggctataa gcgaatgagg cgacaacatc aaaagcaact 


1500 


gatgactctg gaaaacaagc 


taaaggctga 


gatggatgaa 


catcgcctca 


gattagacaa 


1560 


agatcttgaa actcagegta aeaattttgc 


tgcagaaatg 


gagaaactta 


tcaagaaaca 


1620 


ccaggctgcc 


atggagaaag aggctaaagt 


gatgtccaat 


gaagagaaaa 


aatttcagca 


1680 


acatattcag 


gqecaacaga 


agaaagaact 


gaatagtttt 


ctcgagtccc 


agaaaagaga 


1740 


gtataaactt 


cgaaaagagc 


agcttaaaga 


ggagctaaat 


gaaaaccaga 


gtacccccaa 


1800 


aaaagaaaaa 


caggagtggc 


tttcaaagca 


gaaggagaat 


atacagcatt 


tccaagcaga 


1860 


agaagaagct 


aaccttcttc 


gaegtcaaag 


acaataccta gagctggaat 


gccgtcgctt 


1920 


caagagaaga 


atgttacttg ggcgtcataa 


cttagagcag 


gaccttgtca 


gggaggagtt 


1980 


aaacaaaaga 


cagactcaga 


aggacttaga 


geatgecatg 


ctactccgac 


ageatgaafce 


2040 


tatgeaagaa 


ctggagttcc 


. gccacctcaa 


cacaattcag aagatgeget gtgagttgat 


2100 


cagattacag 


catcaaactg 


agctcactaa 


ccagctggaa 


tataataagc 


gaagagaacg 


2160 


agaactaaga 


cgaaagcatg tcatggaagt 


tcgacaacag 


cctaagagtt 


tgaagtetaa 


2220 


agaactccaa 


ataaaaaagc 


agtttcagga 


tacctgeaaa 


atccaaacca 


gacagtacaa 


2280 


agcattaaga 


aatcacctgc 


tggagactac 


accaaagagt 


gagcacaaag 


ctgttctgaa 


2340 


aeggctcaag gaggaacaga cceggaaatt 


agctatcttg 


gctgagcagt 


atgatcacag 


2400 
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cattaatgaa 
ccaggttttg 
aatcaagatg 
ctccctccgg 
tgagcgcaca 
tgactctgaa 
ggcattcagc 
aggacctcac 
aggtggaccc 
cagtatggga 
ggagcagggc 
gtcttataca 
aactgcctac 
agtgcatatg 
ggatgtcata 
gttgatatta 
ttatgcatgt 
gaacgtagca 
aaaagatcta 
agcatatgtt 
tttgttatta 
tccatgaaca 
ctaaaggact 
gaatgcgtca 
gataagatgg 
tgagccttca 
tatgtcccgc 
ctgtgatatc 
tttgtttgta 
tgccatattt 
tatctccttt 



atgctctcca 
aagatgcagc 
caagctgagg 

agggcactct 

gaacgaatac 
agcatgagac 
cacagctacc 
tggggtcatc 
cagccatggg 
gtccgcaata 
atgagcagaa 
taacttaata 
agacatcatc 
gtatatttta 
gtacttggct 
aaaataaatg 
ggtgaaaaga 
aaatggcttc 
aagccttttt 
gaaataagtt 
aattgatctg 
cattgcttct 
tcattgattt 
taggcttgtg 
gggtacttaa 
atggggaaga 
agcagagtga 
tcatcctagc 
caaaaacaaa 
gcaaattgcc 
tggcttatgt 



cacaagccct 
tgcagcagga 
cacaacatga 
tagaacaaaa 
gaagcctgtt 
taggttttag 
cgggagcttc 
ccatgggtgg 
gtcacccttc 
gcccccaggc 
gcacgagtgt 
attgagagtg 
acagcagcct 
ttcatttttg 
gccgggtttg 
tgtatgtgtg 
attggctaga 
agttatcact 
atcccatatc 
attactggta 
acttctcagc 
cggccttttg 
tttcagagag 
agtgattttt 
ggagatcaca 
ccagaagggt 
ggagatgaag 
taagctctat 
gacatatagc 
atctattgaa 
gattttctgt 



gcgtttggat 
actggagctg 
tcgagagctt 
gattgaagaa 
ggaacgtcaa 
taatatggtc 
tggttggtca 
cccaccacaa 
agggccaatg 
tctgaggcgg 
cacttcacaa 
gcaattccgc 
cctcacttgg 
taaagcgttc 
tttgtttttg 
tacatatata 
taggggatfct 
tttgggtgtc 
ccaaattctt 
cacacctgca 
ctcatttgga 
gctaagatca 
tagaaaacaa 
gtccattcaa 
agttgtgtga 
gagaggggcc 
cttacgtgtc 
aatgcccaag 
caatacaaat 
attctcgtca 
ttacaagtag 
26 



gaagcacagg 
ttgaatgcgt 
cgcgagcttg 
gagatgttgg 
gccagagaga 
ctttctaatc 
cacaacccta 
gcttggggcc 
caaggggtac 
acagcttctg 
atatccaatg 
tggagctgtc 
gtactacagt 
tgttttgtgt 
gggaaatttt 
tacacacaca 
ttctgaacac 
tgtatcctaa 
atgagccact 
ttgcctcacc 
ctaaaaaaag 
agtgtagaaa 
cttagttttt 
ttgtgccttc 
ggattgcatt 
ctgaaagttc 
ctgacgtttt 
accccaaaca 
caaatgccgg 
cactacatag 
aatagccaat 



aagcagagtg 
atcagagcaa 
aacagagggt 
ctttgcagaa 
ttgaagcttt 
tctcccctga 
ctgggggtcc 
atccaatgca 
ctcgaggtag 
ggggacggac 
ggtcacacat 
tgccaaaaga 
gtggaagctg 
ttactaattg 
gaaaagtgga 
tacacatata 
tgcaaaaata 
gaagtttctg 
cacagcaggc 
agtgtattta 
aaagcagaaa 
tccatgaaca 
cttttttcct 
tttgtattat 
aacaaaccta 
atatggtggg 
gttgcttata 
gtacttttac 
aggtgtttga 
acataattgt 
tatttaaatg 



2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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3120 

3180 

3240 

3300 

3360 

3 420 

3480 

3540 
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3900 
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tttagttgcc acagtgaacc aggagtcact gagccaatga ctttaccagc tgctgactaa 4320 

tcttcatcac cactgtagat tttgctgcat gtgcaggtcc tctattttta attgctgttt 4380 

tcgttgctgc agtactttac aaacttctag ttcgttgaga cttagtgacc atttggcatc .4440 
aagttaacat cacacaatag gaaacaccac ttccacaagt ctcaagcctc agtgctaaag 



4500 



tactactgaa- aaggaactag gaagtttggc caatt 453 5 

<210> 13 

<211> 3003 

<212> DNA 

<213> Homo sapiens 

<400> 13 

atgccatcaa ctaacagagc aggcagtctt aaggaccctg aaattgcaga gctcttcttc 60 

aaagaagatc cagagaagct cttcacagat ctcagagaaa ttggccatgg aagctttgga 120 

gcagtgtatt ttgcacgaga tgtgcgtacc aatgaagtgg tggcc.atcaa gaaaatgtct 180 

tatagtggaa agcagtctac tgagaaatgg caggatatta ttaaggaagt caagtttcta 240 

caaagaataa aacateccaa cagtatagaa tacaaaggct gttatttacg tgaacacaca 300 

gcatggcttg. taatggaata ttgtttagga tctgcttcgg atttactaga agttcacaaa 360 

aagccattac aagaagtgga aatagcagca attacacatg gtgctcttca gggattagcc 420 
tacttacatt ctcatactat . gattcataga . gatateaaag caggaaatat ccttctgaca 
gaaccaggcc -aggtgaaact tgctgacfctt ggctctgctt ccatggcatc acctgccaat 

tcctttgtgg gaacgccgta ttggatggcc ccagaagtaa ttttagccat ggatgaagga 600 
caatatgatg gcaaagtaga tgtgtggtct cttggaataa catgtattga actagcggaa 
aggaagcctc ctttatttaa tatgaatgca atgagtgcct .tatatcacat agcccaaaat 

gaatcccc ta cactacagtc . taatgaatgg tctgattatt ttcgcaactc tgtagattct 780 

tgcctccaga aaatccctca agatcgacct acatcagagg aacttttaaa gcacatattt 840 

gttcttcggg agcgccctga aaccgtg^ta atagatctca ttcagaggac aaaggatgca 900 

gtaagagagc tggacaatct gcagtatcga .aagatgaaga aactcctttt ccaggaggca 960 
cataatggac cagcagtaga agcacaggaa gaagaagagg aacaagatca tggtgttggc 
cggacaggaa cagttaatag.. tgttggaagt aatcaatcca ttcccagcat gtccatcagt 

gccagcagcc aaagcagtag tgttaacagt cttccagatg tctcagatga caagagtgag 1140 

ctagacatga tggagggaga ccacacagtg atgtctaaca gttctgttat ccatttaaaa 1200 

ccagaggaag aaaattacag agaagaggga gatcctagaa caagagcatc agatccacaa 12 60 

tctccacccc aagtatctcg tcacaaatca cactatcgta atcgagaaca ctttgctact 1320 
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<210> 14 
<211> 3048 
<212> DNA 
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<213> Homo sapiens 



<400> 14 



t get cage ag 


gatgecatea 


actaacagag 


caggcagcct 


gaaggaccct 


gaaattgcag 


60 


agctcttctt 


caaagaagat 


ccagagaagc 


tcttcacaga 


tctcagagaa 


attggccatg 


120 


gaagctttgg 


agcagtgtat 


tttgeacgag 


atgtgcgtac 


caatgaagtg 


gtggccatca 


180 


agaaaatgtc 


ttatagtgga 


aagcagtcta 


ctgagaaatg 


gcaggatatt 


attaaggaag 


240 


tcaagtttct 


acaaagaata 


aaacatccca 


acagtataga 


atacaaaggc 


tgttatttac 


300 


gtgaacacac 


agcatggctt 


gtaatggaat 


attgtttagg 


atetgetteg 


gatttactag 


360 


aagttcacaa 


aaagecatta 


caagaagtgg 


aaatagcagc aattacacat ggtgctcttc 


420 


agggattagc 


ctacttacat 


tctcatacta 


tgattcatag 


agatatcaaa gcaggaaata 


480 


tccttctgac 


agaaccaggc 


caggtgaaac 


ttgetgaett tggctctget 


tccatggcat 


540 


cacctgccaa 


ttcctttgtg 


ggaacgccgt 


attggatggc 


cccagaagta 


attttageca '" 


600 


tggatgaagg 


acaatatgat 


ggcaaagtag 


atgtgtggtc 


tcttggaata 


acatgtattg 


660 


aactagegga 


aaggaagect 


cctttattta 


atatgaatgc 


aatgagtgcc 


ttatatcaca 


720 


tageccaaaa 


tgaatcccct 


acactaeagt 


ctaatgaatg 


gtctgattat 


tttegcaact 


780 


ttgtagattc 


ttgcctceag 


aaaatccctc 


aagatcgacc 


tacatcagag 


gaacttttaa 


840 


agcacatatt 


tgttcttcgg 


gagcgccctg 


aaaccgtgtt 


aatagatctc 


attcagagga 


900 


caaaggatgc 


agtaagagag ctggacaatc 


tgeagtateg 


aaagatgaag 


aaactccttt 


960 


tccaggaggc acataatgga 


ccagcagtag 


aagcacagga 


agaagaagag gaacaagatc 


1020 


atggtgttgg 


ccggacagga 


acagttaata 


gtgttggaag 


taatcaatcc 


attcccagca 


1080 


tgtccatcag 


tgccagcagc 


caaagcagta 


gtgttaacag, tcttccagat 


gtctcagatg 


1140 


acaagagtga 


gctagacatg 


atggagggag 


accacacagt 


gatgtctaac 


agttctgtta 


1200 


tccatttaaa 


accagaggaa 


gaaaattaca 


gagaagaggg 


agatcctaga 


acaagagcat 


1260 


cagatccaca atctccaccc 


caagtatctc 


gtcacaaatc 


acactatcgt 


aatcgagaac 


1320 


actttgetae 


tataeggaca 


gcatcactgg 


ttacgaggca 


aatgcaagaa 


catgagcagg 


1380 


actctgagct 


tagagaacaa 


atgtctggct 


ataagegaat 


gaggegacaa 


catcaaaagc 


1440 


aactgatgac 


tctggaaaac 


aagctaaagg 


ctgagatgga 


tgaacatege 


ctcagattag 


1500 


acaaagatct 


tgaaactcag 


cgtaacaatt 


ttgetgeaga 


aatggagaaa 


cttatcaaga 


1560 


aacaccaggc 


tgctatggag 


aaagaggcta 


aagtgatgtc 


caatgaagag 


aaaaaatttc 


1620 


agcaacatat 


tcaggcccaa 


cagaagaaag 


aactgaatag 


ttttctcgag tcccagaaaa 


1680 


gagagtataa 


acttcgaaaa 


gagcagctta 


aagaggagct 


aaatgaaaac 


cagagtaccc 


1740 


ccaaaaaaga 


aaaacaggag 


tggctttcaa 


agcagaagga 


gaatatacag 


catttccaag 


1800 
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cagaagaaga 
gcttcaagag 
agttaaacaa 
aatctatgca 
tgatcagatt 
aacgagaact 
ctaaagaact 
acaaagcatt 
tgaaacggct 
acagcattaa 
agtgccaggt 
gcaaaatcaa 
gggtctccct 
agaatgagcg 
ctfcttgactc 
ctgaggcatt 
gtccaggacc 
tgcaaggtgg 
gtagcagtat 
ggacagagca 
acatgtctta 



agctaacctt 
aagaatgtta 
aagacagact 
agaactggag 
acagcatcaa 
aagacgaaag 
ccaaataaaa 
aagaaatcac 
caaggaggaa 
tgaaatgctc 
tttgaagatg 
gatgcaagct 
ccggagggca 
cacagaacga 
tgaaagcatg 
cagccacagc 
tcactggggt 
accccagcca 
. gggagtccgc 
gggcatgagc 
tacataactt 



cttcgacgtc 
cttgggcgtc 
cagaaggact 
ttccgccacc 
actgagctca 
catgtcatgg 
aagcagtttc 
ctgctggaga 
cagacccgga 
tccacacaag 
cagctgcagc 
gaggcacaac 
ctcttagaac 
atacgaagcc 
agactaggtt 
tacccgggag 
catcccatgg 
tggggtcacc 
aatagccccc 
agaagcacga 
aataattgag 



aaagacaata 
ataacttaga 
tagagcatgc 
tcaacacaat 
ctaaccagct 
aagttcgaca 
aggatacctg 
ctacaccaaa 
aattagctat 
ccctgcgttt 
aggaactgga 
at gate gaga 
aaaagattga 
tgttggaacg 
ttagtaatat 
cttctggttg 
gtggcccacc 
cttcagggcc 
aggctctgag 
gtgtcacttc 
agtggcaatt 



cctagagctg 
gcaggacctt 
catgctactc 
tcagaagatg 
ggaatataat 
acagectaag 
caaaatccaa 
gagtgagcac 
cttggctgag 
ggatgaagca 
gctgttgaat 
gettcgegag 
agaagagatg 
teaagecaga 
ggtcctttct 
gtcacacaac 
acaagcttgg 
aatgcaaggg 
geggacaget 
acaaatatcc 
ccgctgga 



gaatgccgtc 
gtcagggagg 
cgacagcatg 
cgctgtgagt 
aagegaagag 
agtttgaagt 
accagacagt 
aaagctgttc 
cagtatgatc 
caggaagcag 
gegtatcaga 
cttgaacaga 
ttggctttgc 
gagattgaag 
aatctctccc 
cctactgggg 
ggccatccaa 
gtacctcgag 
tctgggggac 
aatgggtcac 



<210> 15 

<211> 3148 

<212> DNA 

<213> Homo sapiens 

<400> 15 

ggcacgaggg tggcgccggg eggeggggtc 
ccgcgagcag aggctgegea cagccggatc 
ceggggaaca agecacagga gagegactea 
gcggcgccgg gecegggggt ggtgacagca 
ctgaagttat aaaagagaaa agagaagttt 
tattttgtgc cacacacaaa aaggtactga 



1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3048 



ctgcgtggag 
cggcactcag 
ggaacaagtg 
ggtctgaggt 
getgetaaaa 
agatttaccc 



agtgggacgc 
cgaccggacc 
tgggagagga 
tgcatcataa 
tgaatctgag 
cccaaaaaaa 



aacgecgaga 
caaggatccg 
ageggeggeg 
atacaaagga 
caatatggaa 
attgtcaatg 



60 
120 
180 
240 
300 
360 



30 
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agaaataaag ctaactgata tcaaaaagca gagcctgctc tactggccat catgcgtaaa 42 0 

ggggtgctga aggacccaga gattgccgat ctattctaca aagatgatcc tgaggaactt 480 

tttattggtt tgcatgaaat tggacatgga agttttggag cagtttattt tgctacaaat 540 

gctcacacca gtgaggtggt ggcaattaag aagatgtcct atagtgggaa gcagacccat 600 

gagaaatggc aagatattct taaggaagtt aaatttttac gacaattgaa gcatcctaat 660 

actattgagt acaaaggctg ttacttgaaa gaacacactg cttggttggt gatggaatat 720 

tgcttaggct cagcetctga tttattagaa gttcataaaa aaccacttca ggaagtggag 7 80 

atcgctgcca 'ttactcatgg agccttgcat ggactagcct acctacattc tcatgcattg 840 

attcataggg atattaaagc aggaaatatt. cttctaacag agccaggtca ggtaaaacta 900 

gctgattttg gatctgcttc aatggcttct cctgccaact ccttcgtggg cacaccttac 960 

tggatggctc cagaggtgat cttagctatg gatgaaggac,-agtatgatgg gaaagttgat 1020 

atttggtcac. ttggcatcac . ttgtattgaa ttggcggaac ggaagccgcc ccttttcaac 1080 

atgaatgcaa tgagtgcctt atatcacatt . gcccagaatg actccccaac gttacagtct 1140 

aatgaatgga eagac tec t t taggagattt gttgattact gettgeagaa aatacctcag 1200 

gaaaggccaa catcagcaga actattaagg catgactttg ttcgacgaga ccggccacta 1260 

cgtgtcctca ttgacctcat . acagaggaca aaagatgcag ttegtgagct agataaccta 1320 

cagtaccgaa aaatgaaaaa aatacttttc caagagacac ggaatggacc cttgaatgag 13 80 

tcacaggagg atgaggaaga .cagtgaacat ggaaccagcc tgaacaggga aatggacagc 1440 

ctgggcagca accattccat tccaagcatg teegtgagea caggcagcca gagcagcagt 1500 

gtgaacagca tgcaggaagt catggacgag agcagttccg aacttgtcat gatgeacgat 1560 

gacgaaagca caatcaattc cagc.tcctcc gtcgtgcata agaaagatca t:gtattcata 1620 

agggatgagg cgggccacgg cgatcccagg cctgagccgc ggcctaccca gtcagttcag 1680 

agccaggccc tccaetaccg gaacagagag cgctttgcca cgatcaaatc agcatctttg 1740 

gttacacgac agatccatga gcatgagcag gagaacgagt tgegggaaca gatgtcaggt 1800 

tataagegga tgcggcgcca gcaccagaag cagctgatcg ccctggagaa caagctgaag 1860 

gctgagatgg acgagcaccg cctcaagcta cagaaggagg tggagacgea tgccaacaac 192 0 

tcgtccatcg ,agctggagaa gctggccaag aagcaagtgg ctatcataga aaaggaggca 1980 

aaggtagctg cagcagatga gaagaagttc cagcaacaga tcttggccca gcagaagaaa 2040 

gatttgacaa nctttcttaga aagtcagaag aagcagtata agatttgtaa ggaaaaaata 2100 

aaagaggaaa tgaatgagga ccatagcaca cccaagaaag agaagcaaga gcggatctcc 2160 

aaacataaag agaacttgea gcacacacag gctgaagagg aagcccacct tctcactcaa 2220 
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cagagactgt 
cacgaggtgg 
atggagcatg 
ctgcacacgt 
gaaaaccagc 
gaacttcggc 
caggacactt 
gttactccaa 
aaacttgcca 
gcgttacggc 
caggaaatgg 
catgaacgtg 
cagaagattg 
ctattggaaa 
tttgggaatt 
tgccatttac 



actacgacaa 
agcagcagaa 
ccatgctaat 
tacagaagct 
tggagtacaa 
aacagccaaa 
gcaaagtaca 
agaatgagca 
ttttggcaga 
tagatgaggc 
agctgctcaa 
agctccagaa 
aagaggagct 
ggcaagagcg 
tggttacatt 
aaaaaaaaaa 



aaattgtcgt 
cattcgggag 
ccggcacgac 
acgcatggat 
taagaggcga 
aaacttaaag 
gaccaaacag 
caaaacaatc 
gcagtatgaa 
tcaagaagca 
cgcctaccag 
gctagagcag 
ggctgccctt 
agagattgaa 
agattttcct 
aaaaaaaa 



ttcttcaagc 
gaactaaata 
gagtccaccc 
ctgatccgtt 
gaaagagaac 
gccatggaaa 
tataaagcac 
ttaaagacac 
cagagtataa 
gaatgccagg 
agcaaaatca 
agagtgtctc 
cagaaggaac 
acttttgaca 
aaggaggact 



ggaaaataat 
aaaagaggac 
gagagctaga 
tacagcacca 
tgcacagaaa 
tgcaaattaa 
tcaagaatca 
tgaaagatga 
atgaaatgat 
ccttgaggct 
agatgcaaac 
tgcgcagagc 
gcagcgagag 
tggagagcct 
acagatgaga 



gatcaagcgg 
ccagaaggag 
gtacaggcag 
gacggaactg 
gcatgtcatg 
aaaacagttt 
ccagttggaa 
gcagacaaga 
ggcctctcaa 
acagctccag 
agaggcacaa 
acaccttgag 
aataaagaac 
cagaatggga 
ttaaattttt 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3148 



<210> 16 

<211> 1049 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Ala Gly Gly Arg Ala Gly Ser Leu Lys Asp Pro Asp Val Ala 



Met Pro 

1 5 



10 



Glu Leu Phe Phe Lys Asp Asp Pro Glu Lys Leu Phe Ser Asp Leu Arg 
20 25 30 



Glu lie Gly His 
35 



Gly Ser Phe Gly Ala Val Tyr Phe Ala Arg Asp Val 



40 



45 



Arg Asn Ser Glu Val Val Ala He Lys Lys Met Ser Tyr Ser Gly Lys 



50 



55 



Gin Ser Asn Glu Lys Trp Gin Asp He He Lys Glu Val Arg Phe Leu 
65 70 75 

Gin Lys Leu Arg His Pro Asn Thr He Gin Tyr Arg Gly Cys Tyr Leu 
85 90 
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Arg Glu His Thr Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala 
100 105 HO 

Ser Asp Leu Leu Glu Val His Lys Lys Pro Leu Gin Glu Val Glu lie 
115 120 125 

Ala Ala Val Thr His Gly Ala Leu Gin. Gly Leu Ala Tyr Leu His Ser 
130 135 140 

His Asn Met He His Arg Asp Val Lys Ala Gly Asn He Leu Leu Ser 
145 150 155 160 

Glu Pro Gly Leu Val Lys Leu Gly Asp Phe Gly Ser Ala Ser He Met 
165 170 175 

Ala Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu 
180 185 _ 190 

Val He Leu Ala Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Asp Val 
195 200 205 

Trp Ser Leu Gly He Thr Cys He Glu Leu Ala Glu Arg Lys Pro Pro 
210 215 220 

Leu Phe Asn Met Asn Ala Met Ser Ala Leu Tyr His He Ala Gin Asn 
225 230 235 240 

Glu Ser Pro Val Leu Gin Ser Gly His Trp Ser Glu Tyr Phe Arg Asn 
245 250 255 

Phe Val Asp Ser Cys Leu Gin Lys He Pro Gin Asp Arg Pro Thr Ser 
260 ~ 265 270 

Glu Val Leu Leu Lys His Arg Phe . Val Leu Arg Glu Arg Pro Pro Thr 
275 280 285 

Val He Met Asp Leu He Gin Arg Thr Lys Asp Ala Val Arg Glu Leu 
290 295 300 

Asp Asn Leu Gin Tyr Arg Lys Met Lys Lys He Leu Phe Gin Glu Ala 
305 310 315 320 

Pro Asn Gly Pro Gly Ala Glu Ala Pro Glu Glu Glu Glu Glu Ala Glu 
325 330 335 
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Pro Tyr Met His Arg Ala Gly Thr Leu Thr Ser Leu Glu Ser Ser His 
340 345 



Ser Val Pro Ser Met Ser He Ser Ala Ser Ser Gin Ser Ser Ser Val 
355 



360 365 



Asn Ser Leu 
370 



Ala Asp Ala Ser Asp Asn Glu Glu Glu Glu Glu Glu Glu 



375 38° 



Glu Glu Glu Glu Glu Glu Glu Glu Gly Pro Glu Ala Arg Glu Met Ala 
385 390 395 

Met Met Gin Glu Gly Glu His Thr Val Thr Ser His Ser Ser lie He 
405 410 

His Arg Leu Pro Gly Ser Asp Asn Leu Tyr Asp Asp Pro Tyr Gin Pro 
420 425 

Glu lie Thr Pro Ser Pro Leu Gin Pro Pro Ala Ala Pro Ala Pro Thr 



435 



440 445 



Ser Thr Thr Ser Ser Ala Arg Arg Arg Ala Tyr Cys Arg Asn Arg Asp 
450 



~\ '~' 455 460 



His Phe Ala Thr He Arg Thr Ala Ser Leu Val Ser Arg Gin He Gin 
465 470 4 

Glu His Glu Gin Asp Ser Ala Leu Arg Glu Gin Leu Ser Gly Tyr Lys 

,485 49 ~ 

Arg Met Arg Arg Gin His Gin Lys Gin Leu Leu Ala Leu Glu Ser Arg 
500 505 

Leu Arg Gly Glu Arg Glu Glu His Ser Ala Arg Leu Gin Arg Glu Leu 
515 520 525 

Ala Gin Arg Ala Gly Phe Gly Ala Glu Ala Glu Lys Leu Ala Arg 



Glu 

530 



535 540 



Arg His Gin Ala He Gly Glu Lys Glu Ala Arg Ala Ala Gin Ala Glu 
545 550 555 

Glu Arg Lys Phe Gin Gin His He Leu Gly Gin Gin Lys Lys Glu Leu 
565 570 3 



Ala Ala Leu Leu Glu Ala Gin Lys Arg Thr Tyr Lys Leu Arg Lys Glu 
580 585 590 
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Gin Leu Lys Glu Glu Leu Gin Glu Asn Pro Ser Thr Pro Lys Arg Glu 
595 600 605 



Lys Ala Glu Trp Leu Leu Arg Gin Lys Glu Gin Leu Gin Gin Cys Gin 
610 615 620 



Ala Glu Glu Glu Ala Gly Leu Leu Arg Arg Gin Arg Gin Tyr Phe Glu 
625 630 635 640 



Leu Gin Cys Arg Gin Tyr Lys Arg Lys Met Leu Leu Ala Arg His Ser 
645 650 655 



Leu Asp Gin Asp Leu Leu Arg Glu Asp Leu Asn Lys Lys Gin Thr Gin 
660 665 670 



Lys Asp Leu Glu Cys Ala Leu Leu Leu Arg Gin His Glu Ala Thr Arg 
675 680 685 



Glu Leu Glu Leu Arg Gin Leu Gin Ala Val Gin Arg Thr Arg Ala Glu 
690 695 700 



Leu Thr Arg Leu Gin His Gin Thr Glu Leu Gly Asn Gin Leu Glu Tyr 
705 710 715 720 



Asn Lys Arg Arg Glu Gin Glu Leu Arg Gin Lys His Ala Ala Gin Val 
725 730 735 



Arg Gin Gin Pro Lys Ser Leu Lys Ser Lys Glu Leu Gin He Lys Lys 
740 745 750 



Gin Phe Gin Glu Thr Cys Lys lie Gin Thr Arg Gin Tyr Lys Ala Leu 
755 760 765 



Arg Ala His Leu Leu Glu Thr Thr Pro Lys Ala Gin His Lys Ser Leu 
770 775 780 



Leu Lys Arg Leu Lys Glu Glu Gin Thr Arg Lys Leu Ala He Leu Ala 
785 790 795 800 



Glu Gin Tyr Asp Gin Ser He Ser Glu Met Leu Ser Ser Gin Ala Leu 
805 810 815 



Arg Leu Asp Glu Thr Gin Glu Ala Glu Phe Gin Ala Leu Arg Gin Gin 
820 825 830 
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Leu Gin Gin Glu Leu Glu Leu Leu Asn Ala Tyr Gin Ser Lys lie Lys 
835 840 845 

lie Arg Thr Glu Ser Gin His Glu Arg Glu Leu Arg Glu Leu Glu Gin 
850 855 860 

Arg Val Ala Leu Arg Arg Ala Leu Leu Glu Gin Arg Val Glu Glu Glu 

865 870 875 880 

Leu Leu Ala Leu Gin Thr Gly Arg Ser Glu Arg lie Arg Ser Leu Leu 



885 



890 895 



Glu Arg Gin Ala Arg Glu He Glu Ala Phe Asp Ala Glu Ser Met Arg 
900 905 910 

Leu Gly Phe Ser Ser Met Ala Leu Gly Gly He Pro Ala Glu Ala Ala 
915 920 925 

Ala Gin Gly Tyr Pro Ala Pro Pro Pro Ala Pro Ala Trp Pro Ser Arg 
930 935 940 

Pro Val Pro Arg Ser Gly Ala His Trp Ser His Gly Pro Pro Pro Pro 
945 950 955 960 

Gly Met Pro Pro Pro Ala Trp Arg Gin Pro Ser Leu Leu Ala Pro Pro 
965 970 975 

Gly Pro Pro Asn Trp Leu Gly Pro Pro Thr Gin Ser Gly Thr Pro Arg 
980 985 990 

Gly Gly Ala Leu Leu Leu Leu Arg Asn Ser Pro Gin Pro Leu Arg Arg 
995 1000 1005 

Ala Ala Ser Gly Gly Ser Gly Ser Glu Asn Val Gly Pro Pro Ala 
1010 1015 1020 

Ala Ala Val Pro Gly Pro Leu Ser Arg Ser Thr Ser Val Ala Ser 
1025 1030 1035 

His He Leu Asn Gly Ser Ser His Phe Tyr Ser 
1040 1045 



<210> 17 

<211> 898 

<212> PRT 

<213> Homo sapiens 

<400> 17 
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Met Arg Lys Gly Val Leu Lys Asp Pro Glu He Ala Asp Leu Phe Tyr 

1 ~~ ~ 5 10 .15 



Lys Asp Asp Pro Glu Glu Leu Phe He Gly Leu His Glu lie Gly His 
20 25 30 



Gly Ser Phe Gly Ala Val Tyr Phe Ala Thr Asn Ala His Thr Ser Glu 
35 40 45 



Val Val Ala He Lys Lys Met Ser Tyr Ser Gly Lys Gin Thr His Glu 
50 55 60 



Lys Trp Gin Asp lie Leu Lys Glu Val Lys Phe Leu Arg Gin Leu Lys 
65 . ^ 70 75 80 



His Pro Asn Thr He Glu Tyr Lys Gly Cys Tyr. Leu Lys Glu His Thr 
85 90 95 



Ala Trp Leu Val Met Glu Tyr Cys Leu Gly Ser Ala Ser Asp Leu Leu 
100 105 110 



Glu Val His Lys Lys Pro Leu Gin Glu Val Glu He Ala Ala He Thr 
115 120 125 



His Gly Ala Leu His Gly Leu Ala Tyr Leu His Ser His Ala Leu He 
130 135 140 



His Arg Asp He Lys Ala Gly Asn He Leu Leu Thr Glu Pro Gly Gin 
145 ~ 150 . 155 160 



Val Lys Leu Ala Asp Phe Gly Ser Ala Ser Met Ala Ser Pro Ala Asn 
165 170 175 



Ser Phe Val Gly Thr Pro Tyr Trp Met Ala Pro Glu Val He Leu Ala 
180 185 190 



Met Asp Glu Gly Gin Tyr Asp Gly Lys Val Asp He Trp Ser Leu Gly 
195 200 205 



He Thr Cys He Glu Leu Ala Glu Arg Lys Pro Pro Leu Phe Asn Met 
210 215 220 



Asn Ala Met Ser Ala Leu Tyr His He Ala Gin Asn Asp Ser Pro Thr 
225 230 235 240 



Leu Gin Ser Asn Glu Trp Thr Asp Ser Phe Arg Arg Phe Val Asp Tyr 
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245 * 250 255 

Cys Leu Gin Lys lie Pro Gin Glu Arg Pro Thr Ser Ala Glu Leu Leu 
260 265 270 

Arg His Asp Phe Val Arg Arg Asp Arg Pro Leu Arg Val Leu lie Asp 
275 280 285 

Leu lie Gin Arg Thr Lys Asp Ala Val Arg Glu Leu Asp Asn Leu Gin 
290 ~ 295 300 

Tyr Arg Lys Met Lys Lys He Leu Phe Gin Glu Thr Arg Asn Gly Pro 
305 310 315 320 

Leu Asn Glu Ser Gin Glu Asp Glu Glu Asp Ser Glu His Gly Thr Ser 
325 330 335 

Leu Asn Arg Glu Met Asp Ser Leu Gly Ser Asn His Ser He Pro Ser 
340 345 350 

Met Ser Val Thr Trp Asn Gin Pro Glu Gin Gly Asn Gly Gin Pro Gly 
355 360 365 

Gin Gin Pro Phe His Ser Lys His Val Arg Val Met Met His Asp Asp 
370 375 380 

Glu Ser Thr He Asn Ser Ser Ser Ser Val Val His Lys Lys Asp His 
385 390 395 40C 

Val Phe He Arg Asp Glu Ala Gly His Gly Asp Pro Arg Pro Glu Pro 
405 410 415 

Arg Pro Thr Gin Ser Val Gin Ser Gin Ala Leu His Tyr Arg Asn Arg 
420 425 430 

Glu Arg Phe Ala Thr He Lys Ser Ala Ser Leu Val Thr Arg Gin He 
435 440 445 

His Glu His Glu Gin Glu Asn Glu Leu Arg Glu Gin Met Ser Gly Tyr 
450 455 460 

Lys Arg Met Arg Arg Gin His Gin Lys Gin Leu He Ala Leu Glu Asn 
465 ~ 470 475 480 

Lys Leu Lys Ala Glu Met Asp Glu His Arg Leu Lys Leu Gin Lys Glu 
485 490 495 
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Val Glu Thr His Ala Asn Asn Ser Ser lie Glu Leu Glu Lys Leu Ala 

500 505 510 

Lys Lys Gin Val Ala He He Glu Lys Glu Ala Lys Val Ala Ala Ala 

515 520 525 



Asp Glu Lys Lys Phe Gin Gin Gin He Leu Ala Gin Gin Lys Lys Asp 
530 535 540 



Leu Thr Thr Phe Leu Glu Ser Gin Lys Lys Gin Tyr Lys He Cys Lys 
545 550 555 560 



Glu Lys He Lys Glu Glu Met Asn Glu Asp His Ser Thr Pro Lys Lys 
565 570 575 



Glu Lys Gin Glu Arg He Ser Lys His Lys Glu Asn Leu Gin His Thr 
580 585 590 



Gin Ala Glu Glu Glu Ala His Leu Leu Thr Gin Gin Arg Leu Tyr Tyr 
595 600 605 

Asp Lys Asn Cys Arg Phe Phe Lys Arg Lys He Met He Lys Arg His 
610 615 620 

Glu Val Glu Gin Gin Asn He Arg Glu Glu Leu Asn Lys Lys Arg Thr 
625 630 635 640 

Gin Lys Glu Met Glu His Ala Met Leu He Arg His Asp Glu Ser Thr 
645 650 655 



Arg Glu Leu Glu Tyr Arg Gin Leu His Thr Leu Gin Lys Leu Arg Met 
660 665 670 



Asp Leu He Arg Leu Gin His Gin Thr Glu Leu Glu Asn Gin Leu Glu 
675 680 685 



Tyr Asn Lys Arg Arg Glu Arg Glu Leu His Arg Lys His Val Met Glu 
690 695 700 



Leu Arg Gin Gin Pro Lys Asn Leu 
705 710 



Lys Gin Phe Gin Asp Thr Cys Lys 
725 



Leu Lys Asn His Gin Leu Glu Val 



Lys Ala Met Glu Met Gin He Lys 
715 720 

Val Gin Thr Lys Gin Tyr Lys Ala 
730 735 

Thr Pro Lys Asn Glu His Lys Thr 
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740 



745 750 



lie Leu Lys Thr Leu Lys Asp Glu Gin Thr Arg Lys Leu Ala lie Leu 
755 760 765 

Ala Glu Gin Tyr Glu Gin Ser lie Asn Glu Met Met Ala Ser Gin Ala 
770 775 780 

Leu Arg Leu Asp Glu Ala Gin Glu Ala Glu Cys Gin Ala Leu Arg Leu 
785 790 795 800 

Gin Leu' Gin Gin Glu Met Glu Leu Leu Asn Ala Tyr Gin Ser Lys lie 
805 810 815 

Lys Met Gin Thr Glu Ala Gin His Glu Arg Glu Leu Gin Lys Leu Glu 
820 825 830 

Gin Arg Val Ser Leu Arg Arg Ala His Leu Glu Gin Lys lie -Glu Glu 
835 840 845 

Glu Leu Ala Ala Leu Gin Lys Glu Arg Ser Glu Arg lie Lys Asn Leu 
850 855 860 

Leu Glu Arg Gin Glu Arg Glu lie Glu Thr Phe Asp Met Glu Ser Leu 
865 870 875 880 

Arg Met Gly Phe Gly Asn Leu Val Thr Leu Asp Phe Pro Lys Glu Asp 
885 890 895 



Tyr Arg 



<210> 18 

<211> 1005 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Leu Leu Ser Arg Met Pro Ser Thr Asn Arg Ala Gly Ser Leu Lys Asp 
1 5 10 15 

Pro Glu lie Ala Glu Leu Phe Phe Lys Glu Asp Pro Glu Lys Leu Phe 
20 25 30 

Thr Asp Leu Arg Glu He Gly His Gly Ser Phe Gly Ala Val Tyr Phe 
35 40 45 
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Ala Arg Asp Val Arg Thr Asn Glu Val Val Ala lie Lys Lys Met Ser 
50 55 60 



Tyr Ser Gly Lys Gin Ser Thr Glu Lys Trp Gin Asp lie lie Lys Glu 
65 70 7 5 80 



Val Lys Phe Leu Gin Arg He Lys His Pro Asn Ser He Glu Tyr Lys 
85 90 . 95 



Gly Cys Tyr Leu Arg Glu His Thr Ala Trp Leu Val Met Glu Tyr Cys 
100 105 110 



Leu Gly Ser Ala Ser Asp Leu Leu Glu Val His Lys Lys Pro Leu Gin 
115 120 125 



Glu Val Glu He Ala Ala He Thr His Gly Ala Leu Gin Gly Leu Ala 
130 135 140 



Tyr Leu His Ser His Thr Met He His Arg Asp He Lys Ala Gly Asn 
145 150 155 160 



He Leu Leu Thr Glu Pro Gly Gin Val Lys Leu Ala Asp Phe Gly Ser 
165 170 175 



Ala Ser Met Ala Ser Pro Ala Asn Ser Phe Val Gly Thr Pro Tyr Trp 
180 185 190 



Met Ala Pro Glu Val He Leu Ala Met Asp Glu Gly Gin Tyr Asp Gly 
195 200 205 



Lys Val Asp Val Trp Ser Leu Gly He Thr Cys He Glu Leu Ala Glu 
210 215 220 



Arg Lys Pro Pro Leu Phe Asn Met Asn Ala Met Ser Ala Leu Tyr His 
225 230 235 240 



He Ala Gin Asn Glu Ser Pro Thr 
245 

Tyr Phe Arg Asn Phe Val Asp Ser 
260 

Arg Pro Thr Ser Glu Glu Leu Leu 
275 280 

Arg Pro Glu Thr Val Leu He Asp 
290 295 



Leu Gin Ser Asn Glu Trp Ser Asp 
250 255 



Cys Leu Gin Lys He Pro Gin Asp 
265 270 



Lys His He Phe Val Leu Arg Glu 
285 



Leu He Gin Arg Thr Lys Asp Ala 
300 
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Glu Leu Asp Asn Leu Gin Tyr Arg Lys Me, Lys Lys Leu Leu 

Gly Pro Ala Val Glu Ala Gin Glu Glu Glu 

Thr Gly Thr Val Asn Ser Val 
3 50 

He Ser Ala Ser Ser Gin 
365 

Val Ser Asp Asp Lys Ser Glu 
380 

Olu Gly Asp His Thr Val Met Ser Asn Ser Ser Val 



Val Arg Glu Leu Asp — 315 

305 31 



PHe Gin Glu Ala His Asn Gly Pro — v~ 
325 



Glu Glu Gin Asp His Gly val Gly Arg Tnr w 
340 

Gly Ser Asn Gin Ser He Pro Ser Met Ser ^ 
355 

Ser Ser Ser Val Asn Ser Leu Pro Asp ^ 
370 



Leu Asp Met Met Glu Gxy ^ — — 395 
385 39 



Ile - «. « -» -» - - SS " 9 Glu Glu G1Y 5S "° 

405 

, ln e er pro Pro Gin Val Ser Arg His 
Arg Thr Arg Ala Ser Asp Pro Gin Ser Pro ^ 
420 

* „ n u His Phe Ala Thr He Arg Thr Ala 
Lys ser His Tyr Arg Asn Arg Glu Hxs Ph ^ 
435 



465 



MU « ThX „ - ~ - - ~ «» S ^ ^ 
450 455 

_ _ Met Arg Arg Gin His Gin Lys 

47 0 



A rg Glu Gin Met Ser Gly Tyr Lys Arg Met ~» — - 



T .=„ tvs Ala Glu Met Asp Glu His 
; in Leu Met Thr Leu Glu Asn Lys Leu Lys Ala ^ 
485 

.„„ t ,_ u giu Thr Gin Arg Asn Asn Phe Ala 
*rg Leu Arg Leu Asp Lys Asp Leu Glu T 51Q 
500 

r rio Lvs Lys His Gin Ala Ala Met Glu Lys 
Ala Glu Met Glu Lys Leu Ile Lys Lys H ^ 

„ , Met Ser Asn Glu Glu Lys Lys Phe Gin Gin His He 
Ala Lys Val Met Ser Asn w-l" 54Q 



Glu ■ — 535 

530 5J:> 
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Gin Ala Gin Gin Lys Lys Glu Leu Asn Ser Phe Leu Glu Ser Gin Lys 
545 550 555 560 



Arg Glu Tyr Lys Leu Arg Lys Glu Gin Leu Lys Glu Glu Leu Asn Glu 
565 57 0 57 5 



Asn Gin Ser Thr Pro Lys Lys Glu Lys Gin Glu Trp Leu Ser Lys Gin 
580 585 590 



Lys Glu Asn He Gin His Phe Gin Ala Glu Glu Glu Ala Asn Leu Leu 
595 600 605 



Arg Arg Gin Arg Gin Tyr Leu Glu Leu Glu Cys Arg Arg Phe Lys Arg 
610 615 620 



Arg Met Leu Leu Gly Arg His Asn Leu Glu Gin Asp Leu Val Arg Glu 
625 630 635 640 



Glu Leu Asn Lys Arg Gin Thr Gin Lys Asp Leu Glu His Ala Met Leu 
645 650 655 



Leu Arg Gin His Glu Ser Met Gin Glu Leu Glu Phe Arg His Leu Asn 
660 665 670 



Thr lie Gin Lys Met Arg Cys Glu Leu He Arg Leu Gin His Gin Thr 
675 680 685 



Glu Leu Thr Asn Gin Leu Glu Tyr Asn Lys Arg Arg Glu Arg Glu Leu 
690 695 700 



Arg Arg Lys His Val Met Glu Val Arg Gin Gin Pro Lys Ser Leu Lys 
705 710 715 720 



Ser Lys Glu Leu Gin lie Lys Lys Gin Phe Gin Asp Thr Cys Lys He 
725 730 735 



Gin Thr Arg Gin Tyr Lys Ala Leu Arg Asn His Leu Leu Glu Thr Thr 
740 745 750 



Pro Lys Ser Glu His Lys Ala Val Leu Lys Arg Leu Lys Glu Glu Gin 
755 760 765 



Thr Arg Lys Leu Ala He Leu Ala Glu Gin Tyr Asp His Ser He Asn 
770 775 780 



Glu Met Leu Ser Thr Gin Ala Leu Arg Leu Asp Glu Ala Gin Glu Ala 
785 790 795 800 
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Glu Cys Gin Val Leu Lys Met Gin Leu Gin Gin Glu Leu Glu Leu Leu 
805 810 815 

Asn Ala Tyr Gin Ser Lys He Lys Met Gin Ala Glu Ala Gin His Asp 
820 825 830 

Arg Glu Leu Arg Glu Leu Glu Gin Arg Val Ser Leu Arg Arg Ala Leu 
835 ~ 840 845 

Leu Glu Gin Lys He Glu Glu Glu Met. Leu Ala Leu Gin Asn Glu Arg 
850 855 860 

Thr Glu Arg He Arg Ser Leu Leu Glu Arg Gin Ala Arg Glu He Glu 
865 870 875 880 

Ala Phe Asp Ser Glu Ser Met Arg Leu Gly Phe Ser Asn Met Val Leu 
885 890 - 895 

Ser Asn Leu Ser Pro Glu Ala Phe Ser His Ser Tyr Pro Gly Ala Ser 
900 905 910 

Gly Trp Ser His Asn Pro Thr Gly Gly Pro Gly Pro His Trp Gly His 
915 920 925 

Pro Met Gly Gly Pro Pro Gin Ala Trp Gly His Pro Met Gin Gly Gly 
930 " 935 940 

Pro Gin Pro Trp Gly His Pro Ser Gly Pro Met Gin Gly Val Pro Arg 
945 ~ 950 955 960 

Gly Ser Ser Met Gly Val Arg Asn Ser Pro Gin Ala Leu Arg Arg Thr 
965 970 975 

Ala Ser Gly Gly Arg Thr Glu Gin Gly Met Ser Arg Ser Thr Ser Val 
980 985 990 

Thr Ser Gin He Ser Asn Gly Ser His Met Ser Tyr Thr 
995 1000 1005 
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